Management of archival materials with Linked Data and federated queries
DOI:
https://doi.org/10.3989/redc.2016.3.1299Keywords:
Linked Data, ontologies, archives, repositories, information services, Drupal, federated queries, DSpaceAbstract
In this paper the major technologies of the Semantic Web which may be useful for archives management are summarized. Several local and international projects that generate ontologies from standardized descriptions based on ISAD-G are examined. It is also discussed LIAM (Linked Archival Metadata), that facilitates the transformation of archive records into RFD (Resource Description Framework) format. Furthermore, we analyze how Linked Data enables interoperability between information systems and faceted search of OWL (Ontology Web Language), SKOS (Simple Knowledge Organization System) and Dublin Core records. The authors propose the use of a CMS (Content Management System) compatible with SIOC (Semantically-Interlinked Online Communities) and OAI-PMH (Open Archives Initiative - Protocol for Metadata Harvesting) for archive records to improve the exchange and retrieval of information. We specifically describe the technologies used for developing CoroArchivo, system assessed by an experiment that automatically generates ontologies from ISAD-G records stored in DSpace. The evaluation tool lets users perform federated queries based on the OWL vocabulary disjointness and equivalent classes.
Downloads
References
Addis, M.; Allasia, W.; Bailer, W. (2010). 100 Million Hours of Audiovisual Content: Digital Preservation and Access in the PrestoPRIME Project. First International Digital Preservation Interoperability Framework (DPIF) Symposium. Dresden, Germany: ACM.
Alonso-Sierra, L. E., Ortiz-Mu-oz, E.; Hidalgo-Delgado, Y. (2012) Los sistemas de gestión de contenidos en el ámbito de la web semántica: una breve revisión. Serie científica de la Universidad de las Ciencias, 5, 1-9.
Baker, T. (2012). Libraries, languages of description, and linked data: a Dublin Core perspective. Library Hi Tech, 30, pp. 116 - 133. http://dx.doi.org/10.1108/07378831211213256
Berners-Lee, T.; Hendler, J. y Ora Lassila (2001). The Semantic Web. Scientific American, vol. 284 (5), 28-37. http://dx.doi.org/10.1038/scientificamerican0501-34
Bizer, C.; Seaborne, A. (2004). D2RQ - Treating Non-RDF Databases as Virtual RDF Graphs. 3rd International Semantic Web Conference. Hiroshima, Japan: Springer.
Brickley, D.; Miller, L. (2010). FOAF Vocabulary Specification.
Byron, A., Berry, A.; De-Bondt, B. (2012). Using Drupal: choosing and configuring modules to build dynamic websites. Sebastopol (California), O'Reilly Media.
Coppens, S.; Mannes, E.; Walle, R. V. D. (2009). Disseminating heritage records as linked open data. International Journal of Virtual Reality, 8, 39-44.
Coppens, S.; Verborgh, R.; Mannens, E.; Walle, R. V. D. (2013). Querying the Linked Data Graph using owl:sameAs Provenance. Proceedings of the 16th International Conference on Model Driven Engineering Languages and Systems.
Cyganiak, R.; Bizer, C. (2008). Pubby - A Linked Data Frontend for SPARQL Endpoints. http://wifo5-03. informatik.uni-mannheim.de/pubby
Doerr, M.; Gradmann, S.; Hennicke, S.; Isaac, A.; Meghini, C.; Van De Sompel, H. (2011). The Europeana data model (EDM). IFLA 2011: World library and information congress: 76th IFLA general conference and assembly. Gothenburg, Suecia: IFLA.
Domínguez-Velasco, S. (2013a). OWL2tips: herramienta para generar y visualizar ontologías desde Drupal. Beta 2 ed. Santa Clara, Cuba: Universidad Central de las Villas.
Domínguez-Velasco, S. (2013b). Sampras. Alfa 2 ed. Santa Clara, Cuba: Universidad Central de las Villas.
Domínguez-Velasco, S.; Leiva-Mederos, A. (2013). Jump. Beta ed. Santa Clara, Cuba: Universidad Central de las Villas.
Ferro, N.; Silvello, G. (2013). NESTOR: A formal model for digital archives. Information Processing and Management, vol. 49 (6), 1206-1240. http://dx.doi.org/10.1016/j.ipm.2013.05.001
Giasson, F.; D'arcus, B. (2009). Bibliographic Ontology Specification. Structured Dynamics LLC. http:// bibliontology.com/specification
Görlitz, O.; Staab, S. (2011). SPLENDID: SPARQL Endpoint Federation Exploiting VOID Descriptions. The 10th International Semantic Web Conference. http://iswc2011.semanticweb.org/fileadmin/ iswc/Papers/Workshops/COLD/GoerlitzAndStaab_ COLD2011.pdf
Gracy, Karen F. (2014). Archival description and linked data: a preliminary study of opportunities and implementation challenges. Arch Sci. Disponible en http://doiI10.1007/s10502-014-9216-2
Grimoüard, Claire Sibille-de (2014). The Thesaurus for French Local Archives and the Semantic Web. Procedia - Social and Behavioral Sciences, (147), 206–212. http://dx.doi.org/10.1016/j.sbspro.2014.07.153
Hartig, O.; Bizer, C.; Freytag, J. D. (2009). Executing SPARQL Queries over the Web of Linked Data. ISWC '09 Proceedings of the 8th International Semantic Web Conference. Springer-Verlag. http://dx.doi.org/10.1007/978-3-642-04930-9_19
Haslhofer, B.; Roochi, E. M.; Schandl, B.; Zander, S. (2011). Europeana RDF Store Report. Viena, Austria: Universidad de Viena.
Haslhofer, B.; Schnadl, B. (2008). The OAI2LOD Server: Exposing OAI-PMH Metadata as Linked Data. In: International Workshop on Linked Data on the Web (LDOW2008). Beijing, China.
Haslhofer, B.; Schnadl, B. (2010). Interweaving OAI-PMH data sources with the linked data cloud. International Journal of Metadata, Semantics and Ontologies, vol. 5 (1), 17-31. http://dx.doi.org/10.1504/IJMSO.2010.032648
Hernández, A. (2007). Organización y representación del conocimiento: paradigmas, hipertextos y fundamentación metamodélica. Universidad de La Habana. Cuba.
Hidalgo-Delgado, Y.; Rodríguez-Puente, R.; Ortiz- Muñoz, E.; Alonso-Sierra, L. E. (2013). Herramienta para la recolección de metadatos bibliográficos mediante el protocolo OAI-PMH. II Conferencia Internacional de Ciencias Computacionales e Informáticas. La Habana, Cuba.
Jentzsch, A.; Isele, R.; Bizer, C. (2010). Silk - Generating RDF Links while publishing or consuming Linked Data. International Semantic Web Conference Posters&Demos, Shanghai, China.
Koutsomitropoulos, D. A.; Solomou, G. D.; Theodore S. Papatheodorou (2008). Semantic Interoperability of Dublin Core Metadata in Digital Repositories. Innovations in Information Technology, 2008. IIT 2008. International Conference on IEEE. http://dx.doi.org/10.1109/INNOVATIONS.2008.4781709
Ledesma, G. A. (2013). Coroimagen. Ontología para el tratamiento de archivo. Santa Clara, Cuba: Universidad Central de las Villas.
Leiva-Mederos, A.; Senso, J. A.; Domínguez-Velasco, S.; Hípola, P. (2013). Authoris: a tool for authority control in the Semantic Web. Library Hi Tech, vol. 31 (3), 536-553. http://dx.doi.org/10.1108/LHT-12-20112-0135
Lynch, Tom J. (2014). Social Networks and Archival Context Project: A Case Study of Emerging Cyberinfrastructure. Digital Humanities Quarterly, 8(3). Disponible en: http://www.digitalhumanities.org/dhq/vol/8/3/000184/000184.html [Consulta: 1 de septiembre de 2015].
Mena, M. (2006). Retos de la actividad archivística: reporte de conferencias. La Habana, Cuba: Universidad de la Habana.
Merlino-Santesteban, C. (2012). Repositorios institucionales y buscadores web: una interrelación no tan exitosa. 10ª Jornada sobre la biblioteca digital universitaria. Buenos Aires. Argentina.
Moyano Collado, Julián. (2013). La Descripción Archivística. de los Instrumentos de Descripción Hacia la Web Semántica. Anales de Documentación, 16(2), 3–13.
Nam-Park, Ok. (2015). Development of Linked Data for Archives in Korea. DLib Magazine, 21(3-4), 1-13.
Nolle, A.; Nemirovski, G. (2013). ELITE: An Entailment-based Federated Query Engine for Complete and Transparent Semantic Data Integration. In Eiter, T.; Glimm, B.; Kazakov, Y.; Krötzsch, M. (eds.) (2013). 26th International Workshop on Description Logics. Ul, Alemania.
Peroni, S.; Shotton, D. (2012). FaBiO and CiTO: ontologies for describing bibliographic resources and citations. Web Semantics: Science, Services and Agents on the World Wide Web, vol. 17, 33-43. http://dx.doi.org/10.1016/j.websem.2012.08.001
Popitsch, N.; Haslhofer, B. (2011). DSNotify - A solution for event detection and link maintenance in dynamic datasets. Web Semantics: Science, Services and Agents on the World Wide Web, vol. 9 (3), 266-283. http://dx.doi.org/10.1016/j.websem.2011.05.002
Quilitz, B.; Leser, U. (2008). Querying Distributed RDF Data Sources with SPARQ. ESWC'08 Proceedings of the 5th European semantic web conference on The semantic web: research and applications. Springer- Verlag.
Rademaker, A.; Borges-Oliveira, D. A.; de Paiva, V.; Higuchi, S.; Medeiros e Sá, A.; Alvim, M. (2015). A linked open data architecture for the historical archives of the Getulio Vargas Foundation. Int J Digit Libr, (15), 153–167. http://dx.doi.org/10.1007/s00799-015-0147-1
Sánchez Alonso, S.; Sicilia Urbán, M. Á.; Rato Leguina, G. D. (2008). Sobre la interoperabilidad semántica en las descripciones archivísticas digitales. Revista Española de Documentación Científica, vol. 31 (1), 11-34.
Schöpfel, J.; Bescond, I.; Prost, H. (2012). Open is not enough: a case study on grey literature in an OAI environment. The Grey Journal, vol. 8 (2), pp. 112-124.
Schwarte, A.; Haase, P.; Hose, K.; Schenkel, R.; Schmidt, M. (2011). FedX: A Federation Layer for Distributed Query Processing on Linked Open Data. The Semanic Web: Research and Applications, 481-486. http://dx.doi.org/10.1007/978-3-642-21064-8_39
Sing-Borrajo, P. (2013). Reporte de cargas en los datasets de la biblioteca universitaria de la Universidad Central de las Villas. Santa Clara, Cuba: Universidad Central "Marta Abreu" de las Villas, Facultad de Ingeniería Eléctrica, Departamento de Telecomunicaciones.
Solomou, G.; Papatheodorou, T. (2010). The Use of SKOS Vocabularies in Digital Repositories: the DSpace case. Fourth International Conference on Semantic Computing. IEEE. http://dx.doi.org/10.1109/icsc.2010.83
Tummarello, G.; Delbru, R.; Oren, E. (2007). Sindice.com: Weaving the open linked data. 6th international The semantic web and 2nd Asian conference on Asian semantic web conference.
Vasallo, S. (2010). Descrizioni Archivistiche e web semántico: un connubio possibile. Italian Journal of Library and Information Science, 1(1), 169 – 163.
Published
How to Cite
Issue
Section
License
Copyright (c) 2016 Consejo Superior de Investigaciones Científicas (CSIC)

This work is licensed under a Creative Commons Attribution 4.0 International License.
© CSIC. Manuscripts published in both the print and online versions of this journal are the property of the Consejo Superior de Investigaciones Científicas, and quoting this source is a requirement for any partial or full reproduction.
All contents of this electronic edition, except where otherwise noted, are distributed under a Creative Commons Attribution 4.0 International (CC BY 4.0) licence. You may read the basic information and the legal text of the licence. The indication of the CC BY 4.0 licence must be expressly stated in this way when necessary.
Self-archiving in repositories, personal webpages or similar, of any version other than the final version of the work produced by the publisher, is not allowed.