Minimum Information about a Digital Specimen
DRAFT
Why use MIDS?
The need to rapidly digitise millions of specimens in Natural History Collections has seen a staged approach for data capture being widely adopted. Mass digitisation programmes have generally started with the creation of skeletal or stub records which can then be expanded as more funding or support is available. When combined with the previous practice which was often to create relatively full data records for each specimen, there is currently huge variation in the level of digitisation both within and between collections.
The Minimum Information about a Digital Specimen (MIDS) specification has been designed to provide a means of measuring and monitoring the level of digitisation of specimens within a collection. It provides guidance for prioritising the data to be captured as well as recommendations for data standards and mapping structures.
Getting started links
- Normative Information Element & Levels List
- Mappings
- Provide feedback through a github issue
How to use MIDS for digitisation planning
The MIDS standard sits at the heart of all aspects of the digitisation process and can be used to build a digitisation strategy and programme. A digitisation strategy normally includes sections covering the vision, the reasons for digitising, the intended users of the digitised specimens, the scope and prioritisation, the strategic objectives and metrics of success and impact (https://dissco.github.io/DigitisationPlanning/DigPlanning.html). It is easy to see that MIDS is critical for the metrics of success. The mapping of data within MIDS is a key part of building the data structure within a digitisation programme, including mapping to Darwin Core or ABCDEFG and publication on international aggregator portals such as GBIF, GeoCASE, MinDat, etc.
The recommendations included in the MIDS elements will help decision-making about the relevant standards to use, including identifiers. This data structure will also help with managing data in an institutional collections management system (CMS) and with ancillary processes such as citizen science portals.
The different levels of MIDS relate to a staged digitisation programme, which may require different funding, equipment and workflows for each stage of the process. Using MIDS will help in the identification of costs for each stage of digitisation since each level has a clearly defined minimum set of information. Each MIDS digitisation level has been defined to reflect a broad use case, helping to build a business case for funding. In addition, the information elements within each MIDS level have a definition and a purpose to support communication to encourage buy-in from colleagues or funders.
How to use MIDS for prioritisation of digitisation
The information elements within each MIDS level have been selected through a process of discussion and deliberation over several years. These elements represent the data that have formed the basis of many large-scale digitisation programmes, and hence have previously been prioritised, or have been agreed as being of high importance at each stage of digitisation.
It was recognised that, at higher MIDS levels, there were sufficient differences in the data between Biology, Geology and Palaeontology to warrant a distinction between the information elements required at MIDS level 2 and 3.
The MIDS level aimed at within a digitisation programme will depend on many factors, including funding and local priorities. The definition, purpose, recommendations and mapping provided by MIDS will enable an institution to determine which level of MIDS is to be selected as the goal for the collection.
It is important to keep in mind that MIDS defines the “Minimum” information at each level and it is strongly encouraged to capture more information if possible.
Last Updated: 6 February 2025