Tools: Services and connectors for access and exploitation The ELIXIR Tools Platform improves the discovery, quality, and sustainability of bioinformatics software. The main objective is to (1) allow researchers to find, compare, access and deploy software resources and analytical workflows, (2) help software developers to describe, develop and monitor their products, and (3) help establish and host continuous benchmarking efforts across life science domains.
Task 2: Performance benchmarking & technical monitoring
Major achievements in 2019
The team also reached an agreement to make publicly available the results from the DREAM Challenges3 to facilitate their use, reuse, and further exploration. This set the foundations to host future DREAM challenges directly in OpenEBench. To make possible such collaboration, new visualization mechanisms were generated leading to the establishment of the plots visualization gallery.
The focus of the Platform in 2019 was to build on the progress made through the Implementation Study on Biocontainers. The Platform incorporated new technologies for software containerisation into the Biocontainers registry, and explored the federation of the platform to facilitate its sustainability in the long-term. The team also started to plan the implementation of the new ecosystem for the Tools Platform, where components of that ecosystem (bio.tools, Biocontainers, OpenEBench, Galaxy and others) will contribute their content into a central repository to facilitate data and metadata integration, homogenization, and curation. Both Bio.tools and OpenEbench groups have begun to engage with the different ELIXIR Communities to capture their specific requirements around tools registration and benchmarking. Finally, the best practices group started to examine FAIR principles around research software and continue to promote the four simple recommendations for open source software (4OSS)1.
Task 1: Packaging, containerisation & deployment The new BioContainers API follows the GA4GH specification for tool registries that enables the deposition of bioinformatics containers into public repositories and their execution in public cloud environments. During the Biocontainer Implementation Study, more than 1,000 bioinformatics resources were added to BioContainers, increasing the number of tool containers to 8,500 bioinformatics tools and 75,000 containers2.
The Scientific Benchmarking data model was released, based on the feedback provided by different scientific communities. The OpenEBench team deployed the scientific benchmarking front-end, improving the user experience in accessing the benchmarking data.
OpenEBench has also initially developed a set of metrics based on the FAIR principles applied to research software to understand the challenges in automatically measuring different aspects of software quality. This effort will contribute to identifying general trends about how bioinformatics software is produced, maintained and used by the scientific community.
Task 3: Registry of bioinformatics tools metadata and identifiers The bio.tools team engaged with the proteomics community to curate additional 189 tools. A total of 751 tools within the proteomics domain are now accessible via bio.tools.4 The fast content growth in bio.tools, reaching close to 15,000 entries by the end of 2019, was driven by the usage of automatic tool annotation pipeline Pub2Tools5. Pub2Tools finds relevant recent publications that describe bioinformatics tools, annotates them using EDAM terms and creates bio.tools entries. Several smaller tools were developed in collaboration between ELIXIR Nodes in Estonia, Norway, France and Germany to enhance EDAM usage by displaying and selecting EDAM term information6. Interoperability between the major Tools Platform components was increased by finding DOI matches between Galaxy Tools, bio.tools and Debian Med tools and adding a frontend to render an interactive report on the obtained data7. Finally the team has Implemented badges for tool cards that highlight ELIXIR resource collections (ELIXIR Core Data Resources, ELIXIR Deposition Databases, ELIXIR Communities).
1 JimĂŠnez RC, Kuzak M, Alhamdoosh M et al. Four simple recommendations to encourage best practices in research software. F1000Research 2017, 6(ELIXIR):876 (http://doi.org/10.12688/f1000research.11407.1) 2 https://biocontainers.pro 3 http://dreamchallenges.org 4 Vasileios T, Ienasescu H-I, Gabrielaitis D et al. One Thousand and One Software for Proteomics: Tales of the Toolmakers of Science, Journal of Proteome Research 2019, 18, pages 3580-3585 (http://doi.org/10.1021/acs.jproteome.9b00219) 5 https://pub2tools.readthedocs.io/en/latest 6 https://github.com/inkuzmin/edam-select 7 https://github.com/bio-tools-community/galaxy-biotools-deb-toolinfo
22
ELIXIR Annual Report 2019