Embedding Structure into HTML for More Precise Retrieval of Information, A Novel XML Schema

محتوى المقالة الرئيسي

Anwar Alhenshiri
Zainab Afat
Hoda Badesh

الملخص

This paper presents the core of a universal schema to transform each HTML document into XML format. The objective is to embed a sense of structure into textual documents prior to retrieving information. The structure is obtained from the HTML document based on the schema and applied in the form of an XML document. The resulting structure helps with identifying levels of significance in the HTML page. More relevant results can be obtained by including the hidden structure of the text document in the computation of relevancy during retrieval. The preliminary study indicates potential success with larger studies.  

تفاصيل المقالة

كيفية الاقتباس
Alhenshiri, A., Afat, Z., & Badesh, H. (2024). Embedding Structure into HTML for More Precise Retrieval of Information, A Novel XML Schema. The International Journal of Engineering & Information Technology (IJEIT), 7(1), 50–56. https://doi.org/10.36602/ijeit.v7i1.235
القسم
المقالات