|
计算机科学 2004
A Technique for Information Reconstruction of Web Pages Based on ETG
|
Abstract:
Based on Extended Tag Graph (ETG)1], a new technique for information extraction and reconstruction of Web pages has been presented in the paper. We have introduced the concepts of ETG Operations and ETG Reconstruction, and put forward a Tag Structured Query Language (TagSQL)in the design of user interface. By using the language given in the similar form as SQL, a user can describe conveniently the operations for the information extraction and reconstruction of Web pages.