Part of the book series:Studies in Fuzziness and Soft Computing ((STUDFUZZ,volume 111))
233Accesses
2Citations
Abstract
In this paper we present a fuzzy model for representing WEB structured documents in an Information Retrieval System and a flexible query language for expressing soft selection conditions. The documents’ content is organized into thematic (topical) sections where the index terms play a distinct role. The proposed document representation is adaptive to the user, who can indicate the preferred sections of documents, i.e. those which they estimate to bear the most interesting information, and can linguistically quantify the number of sections which determine the global potential interest of the documents.
This is a preview of subscription content,log in via an institution to check access.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Bookstein A. (1980) Fuzzy requests: an approach to weighted Boolean searches. J. of the American Society for Information Science 31, 240–247.
Bordogna, G., Pasi, G. A. (1993) Fuzzy linguistic approach generalizing Boolean IR: a model and its evaluation. J. of the American Society for Information Science, 44 (2), 70–82.
Bordogna G., Pasi G. (1995) Controlling retrieval through a user adaptive representation of documents, Int. J. of approximate reasoning 12, 317–339.
Bordogna G., and Pasi G. (2000) Flexible Representation and Querying of heterogeneous Structured Documents,, Kybernetyca, 36 (6), 617–633.
Chiaramella Y., and Kheirbek A. (1996) An integrated model for Hypermedia and Information Retrieval, in “Information Retrieval and Hypertext”, edited by M. Agosti and A. Smeaton, 136–176.
Buell D.A., and Kraft D.H. (1981) Threshold values and Boolean retrieval systems. Information Processing & Management 17, 127–136.
Christophides V., et al., (1994) From Structured Documents to Novel Query Facilities, in proc. of the ACM SIGMOD Int. Conf. on Management of Data. ACM press, Minneapolis, USA.
Florescu D., Manolescu I., Kossmann D., (1999) Storing and Querying XML data Using an RDBMS, IEEE Data Engineering Bulletin, 22 (3), 27–34.
Kim H., Cho S., (2000), Structured Storage and Retrieval of SGML Documents Using GROVE, Information Processing and Management, 36, 643–657.
Krovetz R., Croft W.B., (1992) Lexical Ambiguity and Information Retrieval. ACM Trans. on Information System, (10)2, 115–141.
Klir G.J., Folger T.A. (1988) Fuzzy Sets, Uncertainty and Information, Prentice Hall PTR Englewood Cliffs.
Kraft, D. H., Bordogna, G. and Pasi, G. (1995) An extended fuzzy linguistic approach to generalize Boolean information retrieval, Journal of Information Sciences, Applications., 2(3), 119–134 .
Lalmas M., Ruthven I., (1998), Representing and retrieving structured documents using the Dempster-Shafer theory of evidence: Modelling and Evaluation, Journal of Documentation, 54 (5), 529–565.
Macleod I. (1990), Storage and Retrieval of Structured Documents, Information Processing and Management, 26 (2), 197–208.
Molinari, A., G. Pasi G. (1996) A Fuzzy Representation of HTML Documents for Information Retrieval Systems, in proc. of IEEE International Conference on Fuzzy Systems, New Orleans, 8–12 September, 1996.
Negoita, C. V. (1973) On the notion of relevance in information retrieval. Kybernetes, 2 (3), 161–165.
Paice, C. D. (1984) Soft evaluation of Boolean search queries in information retrieval systems. Information Technology: Research Development Applications, 3 (1), 33–41.
Papakonstantinou Y., Widom J., Molina H.G., (1996), Object Exchange and Heterogeneous Information sources. In proc. of IEEE Int. Conf. on Engineering, Birmingham, England.
Paradis F., Berrut C., (1996), Experiments with theme extraction in explanatory texts, in proc. of the II Int. Conf. on Conceptions of Library and Information (CoLIB 2), Copenhagen, Denmark, October 13–16, 433446.
Perez-Carballo, J., Strzalkowski, T., (2000) Natural Language Information Retrieval: Progress Report, Information Processing and Management, 36, 155178.
Rao A., et al. (2000) Query Processing in TREC-6, Information Processing and Management, 36, 179–186.
Sager N., (1981) Natural Language Information Processing, Addison Wesley.
Salton, G., Fox, E., Wu, H. (1983) Extended Boolean information retrieval. Communications of the ACM, 26 (12), 1022–1036.
Salton G., and McGill M.J. (1984) Introduction to modern information retrieval. McGraw-Hill Int. Book Co.
Sparck Jones, K. A. (1971) Automatic keyword classification for information retrieval. London, England: Butterworths.
Sparck Jones, K. A. (1972) A statistical interpretation of term specificity and its application in retrieval. Journal of Documentation, 28 (1), 11–20.
van Rijsbergen, C. J. (1979) Information Retrieval. London, England, Butterworths & Co., Ltd.
Yager R. R. (1988) On Ordered Weighted Averaging aggregation Operators in Multi Criteria Decision Making, IEEE Trans. on Systems, Man and Cybernetics 18 (1), 183–190.
The Ordered Weighted Averaging Operators: Theory and Applications, R.R Yager and J. Kacprzyk eds., Kluwer Academic Publishers (1997).
Zadeh, L.A. (1965) Fuzzy sets. Information and control, 8, 338–353.
Zadeh L.A. (1983) A computational Approach to Fuzzy Quantifiers in Natural Languages, Computing and Mathematics with Applications. 9, 149–184.
Author information
Authors and Affiliations
Istituto per le Tecnologie Informatiche Multimediali, CNR, Milano, Italy
Gloria Bordogna & Gabriella Pasi
- Gloria Bordogna
Search author on:PubMed Google Scholar
- Gabriella Pasi
Search author on:PubMed Google Scholar
Editor information
Editors and Affiliations
Institute of Computer Science, Technical University of Lodz, ul. Sterlinga 16/18, 90-217, Lodz, Poland
Piotr S. Szczepaniak
Systems Research Institute, Polish Academy of Sciences, ul. Newelska 6, 01-447, Warsaw, Poland
Piotr S. Szczepaniak & Janusz Kacprzyk &
Facultad de Informática, Universidad Politécnica de Madrid, Campus de Montegancedo, 28660, Madrid, Spain
Javier Segovia
Computer Science Division, Department of Electrical Engineering and Computer Sciences, University of California, 94720-1776, Berkeley, CA, USA
Lotfi A. Zadeh
Rights and permissions
Copyright information
© 2003 Springer-Verlag Berlin Heidelberg
About this chapter
Cite this chapter
Bordogna, G., Pasi, G. (2003). Flexible Representation and Retrieval of WEB Documents. In: Szczepaniak, P.S., Segovia, J., Kacprzyk, J., Zadeh, L.A. (eds) Intelligent Exploration of the Web. Studies in Fuzziness and Soft Computing, vol 111. Physica, Heidelberg. https://doi.org/10.1007/978-3-7908-1772-0_3
Download citation
Publisher Name:Physica, Heidelberg
Print ISBN:978-3-7908-2519-0
Online ISBN:978-3-7908-1772-0
eBook Packages:Springer Book Archive
Share this chapter
Anyone you share the following link with will be able to read this content:
Sorry, a shareable link is not currently available for this article.
Provided by the Springer Nature SharedIt content-sharing initiative