Skip to main content
  • Book
  • © 2011

Learning Structure and Schemas from Documents

  • Presents State-Of-The-Art Methods for Structure Learning and Schema
  • Inference Case Studies and Best Practices from Real Large Scale Digital Libraries, Repositories and Corpora
  • Written by Leading Experts in the Field

Part of the book series: Studies in Computational Intelligence (SCI, volume 375)

Buy it now

Buying options

eBook USD 129.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book USD 169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book USD 169.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Other ways to access

This is a preview of subscription content, log in via an institution to check for access.

Table of contents (19 chapters)

  1. Front Matter

  2. Administrative Document Analysis and Structure

    • Abdel Belaïd, Vincent Poulain D’Andecy, Hatem Hamza, Yolande Belaïd
    Pages 51-71
  3. Automatic Document Layout Analysis through Relational Machine Learning

    • Stefano Ferilli, Teresa M. A. Basile, Nicola Di Mauro, Floriana Esposito
    Pages 73-96
  4. Dataspaces: Where Structure and Schema Meet

    • Maurizio Atzori, Nicoletta Dessì
    Pages 97-119
  5. Transductive Learning of Logical Structures from Document Images

    • Michelangelo Ceci, Corrado Loglisci, Donato Malerba
    Pages 121-142
  6. Progressive Filtering on the Web: The Press Reviews Case Study

    • Andrea Addis, Giuliano Armano, Eloisa Vargiu
    Pages 143-163
  7. A Hybrid Binarization Technique for Document Images

    • Vavilis Sokratis, Ergina Kavallieratou, Roberto Paredes, Kostas Sotiropoulos
    Pages 165-179
  8. Digital Libraries and Document Image Retrieval Techniques: A Survey

    • Simone Marinai, Beatrice Miotti, Giovanni Soda
    Pages 181-204
  9. Mining Biomedical Text towards Building a Quantitative Food-Disease-Gene Network

    • Hui Yang, Rajesh Swaminathan, Abhishek Sharma, Vilas Ketkar, Jason D‘Silva
    Pages 205-225
  10. Mining Tinnitus Data Based on Clustering and New Temporal Features

    • Xin Zhang, Pamela Thompson, Zbigniew W. RaÅ›, Pawel Jastreboff
    Pages 227-245
  11. MANENT: An Infrastructure for Integrating, Structuring and Searching Digital Libraries

    • Angela Locoro, Daniele Grignani, Viviana Mascardi
    Pages 315-341
  12. Low-Level Document Image Analysis and Description: From Appearance to Structure

    • Emanuele Salerno, Pasquale Savino, Anna Tonazzini
    Pages 343-367
  13. Model Learning from Published Aggregated Data

    • Janusz Wojtusiak, Ancha Baranova
    Pages 369-384
  14. Data De-duplication: A Review

    • Gianni Costa, Alfredo Cuzzocrea, Giuseppe Manco, Riccardo Ortale
    Pages 385-412
  15. A Survey on Integrating Data in Bioinformatics

    • Andrea Manconi, Patricia Rodriguez-Tomé
    Pages 413-432

About this book

The rapidly growing volume of available digital documents of various formats and the possibility to access these through Internet-based technologies, have led to the necessity to develop solid methods to properly organize and structure documents in large digital libraries and repositories. Due to the extremely large volumes of documents and to their unstructured form, most of the research efforts in this direction are dedicated to automatically infer structure and schemas that can help to better organize huge collections of documents and data.

 

This book covers the latest advances in structure inference in heterogeneous collections of documents and data. The book brings a comprehensive view of the state-of-the-art in the area, presents some lessons learned and identifies new research issues, challenges and opportunities for further research agenda and developments.  The selected chapters cover a broad range of research issues, from theoretical approaches to case studies and best practices in the field.

 

Researcher, software developers, practitioners and students interested in the field of learning structure and schemas from documents will find the comprehensive coverage of this book useful for their research, academic, development and practice activity.

Editors and Affiliations

  • University of New York Tirana, Tirana, Albania

    Marenglen Biba

  • Technical University of Catalonia, Barcelona, Spain

    Fatos Xhafa

Bibliographic Information

  • Book Title: Learning Structure and Schemas from Documents

  • Editors: Marenglen Biba, Fatos Xhafa

  • Series Title: Studies in Computational Intelligence

  • DOI: https://doi.org/10.1007/978-3-642-22913-8

  • Publisher: Springer Berlin, Heidelberg

  • eBook Packages: Engineering, Engineering (R0)

  • Copyright Information: Springer-Verlag GmbH Berlin Heidelberg 2011

  • Hardcover ISBN: 978-3-642-22912-1Published: 03 September 2011

  • Softcover ISBN: 978-3-662-50671-4Published: 23 August 2016

  • eBook ISBN: 978-3-642-22913-8Published: 25 September 2011

  • Series ISSN: 1860-949X

  • Series E-ISSN: 1860-9503

  • Edition Number: 1

  • Number of Pages: XVIII, 441

  • Topics: Computational Intelligence, Artificial Intelligence

Buy it now

Buying options

eBook USD 129.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book USD 169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book USD 169.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Other ways to access