Research on the Content Structuring Indexing and Split Techniques Based on Word Processor in MS Word
Proposes three technologies that involved in content structural indexing and splitting of.docx document in CoSIS system:indexing content structure automatically; transforming formula object to standard MathML code; semantic indexing over.docx document.The experimental results conducts on CoSIS reveal that these technologies work well in practice.
Word ProcessorContent StructuringStructured IndexingConversion FormulaMathMLSemantic Indexing