Embedding Structure into HTML for More Precise Retrieval of Information, A Novel XML Schema

Authors

  • Anwar Alhenshiri
  • Zainab Afat
  • Hoda Badesh

DOI:

https://doi.org/10.36602/ijeit.v7i1.235

Keywords:

information retrieval, databases, HTML, XML, schema, structured retrieval, query, search, relevancy

Abstract

This paper presents the core of a universal schema to transform each HTML document into XML format. The objective is to embed a sense of structure into textual documents prior to retrieving information. The structure is obtained from the HTML document based on the schema and applied in the form of an XML document. The resulting structure helps with identifying levels of significance in the HTML page. More relevant results can be obtained by including the hidden structure of the text document in the computation of relevancy during retrieval. The preliminary study indicates potential success with larger studies.  

Downloads

Download data is not yet available.

Downloads

Published

2024-06-13

How to Cite

Embedding Structure into HTML for More Precise Retrieval of Information, A Novel XML Schema. (2024). The International Journal of Engineering & Information Technology (IJEIT), 7(1), 50-56. https://doi.org/10.36602/ijeit.v7i1.235

Similar Articles

1-10 of 33

You may also start an advanced similarity search for this article.