Embedding Structure into HTML for More Precise Retrieval of Information, A Novel XML Schema

Main Article Content

Anwar Alhenshiri
Zainab Afat
Hoda Badesh

Abstract

This paper presents the core of a universal schema to transform each HTML document into XML format. The objective is to embed a sense of structure into textual documents prior to retrieving information. The structure is obtained from the HTML document based on the schema and applied in the form of an XML document. The resulting structure helps with identifying levels of significance in the HTML page. More relevant results can be obtained by including the hidden structure of the text document in the computation of relevancy during retrieval. The preliminary study indicates potential success with larger studies.  

Article Details

How to Cite
Alhenshiri, A., Afat, Z., & Badesh, H. (2024). Embedding Structure into HTML for More Precise Retrieval of Information, A Novel XML Schema. The International Journal of Engineering & Information Technology (IJEIT), 7(1), 50–56. https://doi.org/10.36602/ijeit.v7i1.235
Section
Artical