Our main goal was to design and build a taxonomy of medieval Portuguese history using an interdisciplinary approach based on doctoral research. First, the criteria used for the selection of the vocabulary and its formal and semantic normalization were determined. Then species were listed, followed by the characterization of categories and their respective subclasses. As conclusions we highlight the successful application of the selected terms, as well as the fact that the taxonomy’s categories are being continuously updated and expanded, both in their global extension and in the depth of their thematic representation. In addition we offer proposals for continuing the ontological development of this taxonomy.
Se ha fijado como objetivo principal diseñar y elaborar una taxonomía sobre Historia medieval portuguesa, y hacerlo mediante un abordaje interdisciplinar y como fruto de las investigaciones de tesis. En su proceso se determinaron, primero, los criterios seguidos para la selección y normalización formal y semántica del vocabulario. Luego se listaron las especies y se caracterizaron las categorías y las respectivas subclases. Como conclusión se destaca el hecho de que se han experimentado con éxito métodos antes ya probados en otros estudios para proyectar, desarrollar y mantener taxonomías, con independencia del nivel de especificidad temática, hecho que plantea otro tipo de retos. Sin olvidarse de avanzar propuestas que le den continuidad hacia su desarrollo ontológico.
This note describes a taxonomy aimed at organizing and representing information related to Portuguese Medieval History between the 12th and the 15th centuries, which is currently available online as a structured list. This is an applied work (Medeiros,
The construction of a taxonomy within the scope of Medieval History is, as we shall see, a pioneering choice and, as such, there are no parameters available for comparison. It should be understood as an instrument that undergoes a continuous construction and evaluation process, but whose purposes are clear.
The construction of this taxonomy seeks to fill a gap found in the Portuguese information units specialized in Medieval History, many of which are integrated into universities and their research centres. Moreover, the construction of this controlled vocabulary seeks mainly to address the need for the indexation of digital resources, namely the ones contained in specialized databases, which are currently seen as crucial vehicles for disseminating and sharing open-access scientific knowledge. We can mention, as an example, a series of databases that have already been developed by the Institute for Medieval Studies of the Faculdade de Ciências Sociais e Humanas of the Universidade Nova de Lisboa (IEM - FCSH/UNL), available at:
In order to build and develop this taxonomy we used a top-down, or general-to-specific, method (Conway and Sligar,
Compiling the knowledge;
Reducing the number of synonyms and choosing the preferential terms;
Preparing a preliminary systematic structure;
Developing the first draft of the scheme;
Completing the taxonomy with all its terms;
Evaluating its performance;
Publishing its first version.
Next, we detail the specific development of each of the stages in the construction of the Portuguese Medieval History taxonomy, following the top-down method.
It should be noted here that since the source language of the taxonomy is Portuguese, we naturally followed the guidelines provided by national standards, namely the 4036 Portuguese Standard (items 6 and 7) and the
The morphological and syntactic control has not proved particularly complex. The Portuguese language rules were followed, as well as the national standards. The same did not occur with the semantic control of some terms, as is the case of the word “Jantar” and the term “Cantigas de escárnio e de maldizer”. The first is an example of multiple meanings (semantic ambiguities via significant pathway) and the second, a case of synonymity (semantic ambiguities via meaning).
As we can see in
The distribution of the terms across the hierarchical structure was made alphabetically, except in the «Events», «Personalities» and «Reigns» categories, in which the terms were introduced chronologically. This was the only way to organize them diachronically, allowing the user to browse the taxonomic structure according to the natural sequence of events, personalities and reigns, respectively:
Ex. 1128, Batalha de São Mamede
Ex. 1350-1405, Leonor Teles
Ex. 1385-1433, Reinado de D. João I
Next, in a fourth stage, we prepared the first version of the taxonomic scheme, defining the hierarchical position of the terms within the respective categories and, consequently, ordering each specific term under the corresponding generic term. Subsequently, the terms were divided into two types of classes within each category: chains and rows. The former are vertical series of concepts and the latter are horizontal series of concepts; each series may be generic (type?) or partitive (part? whole?) (Campos e Gomes,
These procedures allowed us to move on to the fifth stage: preparing the classification superstructure of the taxonomy and filling it with all the terms. The final version includes the following 17 categories (see
1) Administration
2) Fields of History
3) Historiographical sciences and techniques
4) Crown
5) Chronology
6) Culture
7) Economy
8) Social structure
9) Historical sources
10) Geography
11) War
12) Personalities
13) Information resources
14) Reigns
15) Religion
16) Society
17) Private life
In a sixth stage, we evaluated the taxonomy according to the recommendations of the ANSI/NISO Z39.19-2005 standard, because both the NP 4036 and the ISO 25964-2011 standards fail to mention this issue. So, we chose to conduct a heuristic evaluation of the final version of the taxonomy, since we believe that this is the most appropriate approach to the controlled vocabulary we have built; it is a domain-based taxonomy and, as such, it implies a real and permanent intervention and monitoring of the construction process by experts, which is what actually happened (ANSI/NISO Z39.19,
Therefore, we asked for the formal collaboration of two experts, one from the field of Portuguese Medieval History, and the other one from the field of Library and Information Science (LIS) –specialized in knowledge organization systems– whose evaluations we present further ahead.
In the seventh and last stage, we published the first version of the taxonomy. All the terms that had been compiled and normalized were fed into a specific taxonomy management software called Knowledge Manager (KM), marketed by the Spanish company
Take, as an example, the search term “Demanda do Santo Graal”.
In
Currently, the taxonomy includes 2987 terms which are constantly being reviewed and updated.
The full version of the taxonomy is available online at:
As shown in
So, «Chronology» is the largest category, with 570 terms, while «Historiographical sciences and techniques» has just 9 terms.
The categories «Personalities», «Economy», «War», «Culture» and «Society» also stand out with 523, 486, 265, 244 and 231 terms, respectively.
On an intermediate level we list the categories «Administration», «Social structure», «Religion» and «Historical sources», which have, correspondingly, 142, 132, 114 and 93 terms.
Finally, and with a smaller number of terms, we identify the categories «Geography», «Crown», «Private life», «Information resources», «Fields of History», «Reigns» and «Historiographical sciences and techniques», which have 47, 40, 38, 22, 17, 14 and 9 terms, respectively.
We have to consider the diverse sizes and depths of categories. This is due mainly to the different scopes of these specific categories. Other factors are the uneven distribution of specific sources for each category, and the variable extent to which different topics of Medieval History are represented in the scientific knowledge under evaluation - that which is developed within Portuguese universities.
With regard to the evaluation of the taxonomy, generally speaking, we asked our medievalist to conduct a semantic evaluation (regarding timeliness and communicability of the information) in addition, we asked our expert on representation of information to conduct a formal evaluation (on the morphological and syntactic aspects). For that purpose, we prepared two analysis grids, following the recommendations of the ANSI/NISO Z39.19-2005 standard. As expected in the case of interdisciplinary research works, the results of both evaluations were very positive and complemented each other; this circumstance is essential for the work under study, given the need for a joint terminological validation.
By way of conclusion, we make some essential considerations:
First, it should be noted that the taxonomy we constructed, when applied to research, successfully tested the methods previously tried by experts for designing, developing and maintaining controlled vocabularies of this nature.
We should also note that, although this taxonomy is representative of one country at one particular historical moment, it can still function as a starting point for building other controlled vocabularies with the same thematic scope, referring to different geographical locations, in the same time frame.
Finally, we stress the pioneering nature of this taxonomy within the panorama of Portuguese Medieval Studies. Precisely due to the absence of comparative structures, both at the national and at the international levels, its construction was a great challenge that we were only able to meet thanks to an interdisciplinary collaboration between medievalists and I&D professionals. That is why we would like to stress the idea that the taxonomy we have built is, above all, a first attempt at defining a terminological categorization for this subject matter. As such, it is open for discussion and it undergoes a permanent formal and semantic evolution process, as the terms of the taxonomy are applied to existing specialized databases, on an experimental basis for the time being.
With regard to future developments, we consider that the taxonomy can integrate various materials from various digital resources (databases, bibliographies and other specialized compilations) into a single format (xml) and classify them all using the same taxonomy.
We would also like to emphasize that the reference taxonomy is still under construction because, even though the semantic structure is finished, we still have to develop a way to determine names through labels and schematic links which favours its reuse (Daconta et al,
We would like to thank the medievalists and the I&D professionals for their collaboration in the semantic and formal validation of the taxonomy, respectively.
This article was also made possible thanks to the collaboration of the Spanish company
Note: Since it is impossible to include all the bibliographic references that were used as sources of information for the selection of the terms included in the taxonomy, we decided to mention only general histories and reference university textbooks from the field of Portuguese Medieval Studies.