Jacob et al., 2003
| Publication | Publication Date | Title |
|---|---|---|
| Yoshikawa et al. | XRel: a path-based approach to storage and retrieval of XML documents using relational databases | |
| Bex et al. | Inferring XML schema definitions from XML data | |
| US6738767B1 (en) | System and method for discovering schematic structure in hypertext documents | |
| US7693848B2 (en) | Method and apparatus for structuring documents based on layout, content and collection | |
| US7912705B2 (en) | System and method for extracting information from text using text annotation and fact extraction | |
| US8935267B2 (en) | Apparatus and method for executing different query language queries on tree structured data using pre-computed indices of selective document paths | |
| Tao et al. | Automatic hidden-web table interpretation, conceptualization, and semantic annotation | |
| Ni et al. | GLASS: A graphical query language for semi-structured data | |
| US7774699B2 (en) | Parallel data transformation | |
| Zhang et al. | Adding valid time to XPath | |
| Jacob et al. | Cx-diff: A change detection algorithm for xml content and change presentation issues for webvigil | |
| Jacob et al. | CX-DIFF: a change detection algorithm for XML content and change visualization for WebVigiL | |
| Pandrangi et al. | WebVigiL: user profile-based change detection for HTML/XML documents | |
| Pembe et al. | A tree-based learning approach for document structure analysis and its application to web search | |
| Lam et al. | A method for web information extraction | |
| Choi | TPEMatcher: A tool for searching in parsed text corpora | |
| Khrouf et al. | A Textual Warehouse Approach: A Web Data Repository | |
| Pembe et al. | A Tree Learning Approach to Web Document Sectional Hierarchy Extraction. | |
| Sasaki et al. | Declarations of relations, differences and transformations between theory-specific treebanks: A new methodology | |
| Gançarski et al. | Attribute grammar-based interactive system to retrieve information from XML documents | |
| Burget | Information Extraction from HTML Documents Based on Logical Document Structure | |
| Lehtonen | Preparing heterogeneous XML for full-text search | |
| Badr et al. | Transformation rules from semi-structured XML documents to database model | |
| Di Iorio et al. | Handling markup overlaps using OWL | |
| Medina et al. | Describing document hierarchies by using markup languages |