Abstract
With the explosion of information on the Web, information that is made available from websites is generally overwhelming to users surfing the sites. The majority of the users who are facing this information overloading problem are the ordinary home users who do not have much technical knowledge. It is thus important to allow these users to easily create personalized views of websites such that they only see what they want in the way they prefer. In this paper, we propose the concept of a personalized Web view to cater to this requirement. Underlying this concept is a data model that represents websites from the logical point of view and a declarative langauge that transforms logical views into personalized Web views. To empower ordinary users with the ability to build their own personalized Web views, we have designed and implemented a software system, known as Wiccap. This system includes a wizard to help users create data models that map physical websites into logical views. It also has an information extraction agent that allows users to instantiate their personalized Web views of the target websites by transforming from logical views previously defined. In order to increase the fun and flexibility of using this software, a flexible presentation toolkit has been designed to present the information in a manner that is programmable by the users.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Adelberg, B.: NoDoSE – A Tool for Semi-Automatically Extracting Semi-Structured Data from Text Documents. In: Proceedings ACM SIGMOD International Conference on Management of Data (SIGMOD 1998), Seattle, Washington, USA, pp. 283–294 (1998)
Baumgartner, R., Flesca, S., Gottlob, G.: Visual Web Information Extraction with Lixto. In: Proceedings of 27th International Conference on Very Large Data Bases (VLDB 2001), Roma, Italy, pp. 119–128 (2001)
Crescenzi, V., Mecca, G., Merialdo, P.: RoadRunner: Towards Automatic Data Extraction from Large Web Sites. In: Proceedings of 27th International Conference on Very Large Data Bases (VLDB 2001), Roma, Italy, pp. 109–118 (2001)
Embley, D.W., Campbell, D.M., Jiang, Y.S., Liddle, S.W., Lonsdale, D.W., Ng, Y.K., Smith, R.D.: Conceptual-model-based data extraction from multiple-record Web pages. Data & Knowledge Engineering 31, 227–251 (1999)
Liu, L., Pu, C., Han, W.: XWRAP: An XML-Enabled Wrapper Construction System for Web Information Sources. In: Proceedings of the 16th International Conference on Data Engineering (ICDE 2000), San Diego, California, USA, pp. 611–621 (2000)
Liu, Z., Ng, W.K., Lim, E.P., Li, F.: Towards Building Logical Views of Websites. To Appear in Data & Knowledge Engineering (2004)
Liu, Z., Ng, W.K., Lim, E.P., Li, F., Huang, Y.: Unloading Unwanted Information: From Physical Websites to Personalized Web Views. Technical report, Centre for Advanced Information Systems, Nanyang Technological University, Singapore (2003)
Sahuguet, A., Azavant, F.: Building intelligent Web applications using lightweight wrappers. Data & Knowledge Engineering 36, 283–316 (2001)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2004 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Liu, Z., Ng, W.K., Lim, EP., Huang, Y., Li, F. (2004). Unloading Unwanted Information: From Physical Websites to Personalized Web Views. In: Yu, J.X., Lin, X., Lu, H., Zhang, Y. (eds) Advanced Web Technologies and Applications. APWeb 2004. Lecture Notes in Computer Science, vol 3007. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-24655-8_12
Download citation
DOI: https://doi.org/10.1007/978-3-540-24655-8_12
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-21371-0
Online ISBN: 978-3-540-24655-8
eBook Packages: Springer Book Archive