SeMRA Raw Semantic Mappings Database
The SeMRA Raw Semantic Mappings Database contains unprocessed semantic mappings
assembled from hundreds of ontologies and databases through pyobo.
Reproduction
The SeMRA Raw Semantic Mappings Database can be rebuilt with the following commands:
$ uv pip install semra
$ semra build
The semra build command downloads and process all resource and constructs
a database of unprocessed mappings.
Note
Downloading raw data resources can take on the order of hours to tens of hours depending on your internet connection and the reliability of the resources’ respective servers.
Processing and analysis can be run overnight on commodity hardware (e.g., a 2023 MacBook Pro with 36GB RAM).
Web Application
The SeMRA Raw Semantic Mappings Database can be downloaded from Zenodo at .
After downloading all files and unzipping then, a web application wrapping
the SeMRA Raw Semantic Mappings Database run locally on Docker with:
$ sh run_on_docker.sh
Navigate to http://localhost:8773 to see the web application.
Domain-specific Processed Mapping Databases
The domain-specific processed mapping databases and meta-landscape analysis can be reconstructed with the following commands:
$ uv pip install semra[landscape]
$ semra landscape
The semra landscape command runs all pre-configured domain-specific mapping
database construction, landscape analyses, and the meta-landscape analysis.
Note
Downloading raw data resources can take on the order of hours to tens of hours depending on your internet connection and the reliability of the resources’ respective servers.
Processing and analysis can be run overnight on commodity hardware (e.g., a 2023 MacBook Pro with 36GB RAM).
The results can be found here:
Domain |
Docs and Reproduction |
Database Download |
Analysis |
|---|---|---|---|
Disease |
|||
Cell and Cell Line |
|||
Anatomy |
|||
Protein Complex |
|||
Gene |
Disease
The SeMRA Disease Mappings Database assembles semantic mappings to the following resources:
Prefix |
Name |
|---|---|
Human Disease Ontology |
|
Mondo Disease Ontology |
|
Experimental Factor Ontology |
|
Medical Subject Headings |
|
NCI Thesaurus |
|
Orphanet |
|
Orphanet Rare Disease Ontology |
|
Unified Medical Language System Concept Unique Identifier |
|
Online Mendelian Inheritance in Man |
|
OMIM Phenotypic Series |
|
Genetic and Rare Diseases Information Center |
|
International Classification of Diseases, 10th Revision |
|
International Classification of Diseases, 10th Revision, Clinical Modification |
|
International Classification of Diseases, 10th Revision, Procedure Coding System |
|
International Classification of Diseases, 11th Revision (Foundation Component) |
|
ICD 11 Codes |
|
International Classification of Diseases, 9th Revision |
|
International Classification of Diseases, 9th Revision, Clinical Modification |
|
International Classification of Diseases for Oncology |
Results
The SeMRA Disease Mappings Database is available for download as SSSOM, JSON, and
in a format ready for loading into a Neo4j graph database
on Zenodo at .
A summary of the results can be viewed on the SeMRA GitHub repository in the landscape/disease folder.
Reproduction
The SeMRA Disease Mappings Database can be rebuilt with the following commands:
$ git clone https://github.com/biopragmatics/semra.git
$ cd semra
$ uv pip install .[landscape]
$ python -m semra.landscape.disease
Note
Downloading raw data resources can take on the order of hours to tens of hours depending on your internet connection and the reliability of the resources’ respective servers.
Processing and analysis can be run overnight on commodity hardware (e.g., a 2023 MacBook Pro with 36GB RAM).
Web Application
The pre-built artifacts for this mapping database can be downloaded from Zenodo
at and unzipped. The web application can be run
locally on Docker from inside the folder where the data was unzipped with:
$ sh run_on_docker.sh
If you reproduced the database yourself, you can cd
to the right folder and run with:
$ cd ~/.data/semra/case-studies/disease
$ sh run_on_docker.sh
Finally, navigate in your web browser to http://localhost:8773 to see the web application.
Variables
Configuration for the disease mappings database |
Cell and Cell Line
The SeMRA Cell and Cell Line Mappings Database assembles semantic mappings to the following resources:
Prefix |
Name |
|---|---|
Medical Subject Headings |
|
Experimental Factor Ontology |
|
Cellosaurus |
|
Cancer Cell Line Encyclopedia Cells |
|
DepMap Cell Lines |
|
BRENDA Tissue Ontology |
|
Cell Ontology |
|
Cell Line Ontology |
|
NCI Thesaurus |
|
Unified Medical Language System Concept Unique Identifier |
Results
The SeMRA Cell and Cell Line Mappings Database is available for download as SSSOM, JSON, and
in a format ready for loading into a Neo4j graph database
on Zenodo at .
A summary of the results can be viewed on the SeMRA GitHub repository in the landscape/cell folder.
Reproduction
The SeMRA Cell and Cell Line Mappings Database can be rebuilt with the following commands:
$ git clone https://github.com/biopragmatics/semra.git
$ cd semra
$ uv pip install .[landscape]
$ python -m semra.landscape.cell
Note
Downloading raw data resources can take on the order of hours to tens of hours depending on your internet connection and the reliability of the resources’ respective servers.
Processing and analysis can be run overnight on commodity hardware (e.g., a 2023 MacBook Pro with 36GB RAM).
Web Application
The pre-built artifacts for this mapping database can be downloaded from Zenodo
at and unzipped. The web application can be run
locally on Docker from inside the folder where the data was unzipped with:
$ sh run_on_docker.sh
If you reproduced the database yourself, you can cd
to the right folder and run with:
$ cd ~/.data/semra/case-studies/cell
$ sh run_on_docker.sh
Finally, navigate in your web browser to http://localhost:8773 to see the web application.
Variables
Configuration for the cell and cell type mappings database |
Protein Complex
The SeMRA Protein Complex Mappings Database assembles semantic mappings to the following resources:
Prefix |
Name |
|---|---|
Complex Portal |
|
FamPlex |
|
Gene Ontology |
|
ChEMBL target |
|
Wikidata |
|
Selventa Complexes |
|
Signaling Network Open Resource |
|
IntAct protein interaction database |
Results
The SeMRA Protein Complex Mappings Database is available for download as SSSOM, JSON, and
in a format ready for loading into a Neo4j graph database
on Zenodo at .
A summary of the results can be viewed on the SeMRA GitHub repository in the landscape/complex folder.
Reproduction
The SeMRA Protein Complex Mappings Database can be rebuilt with the following commands:
$ git clone https://github.com/biopragmatics/semra.git
$ cd semra
$ uv pip install .[landscape]
$ python -m semra.landscape.complex
Note
Downloading raw data resources can take on the order of hours to tens of hours depending on your internet connection and the reliability of the resources’ respective servers.
Processing and analysis can be run overnight on commodity hardware (e.g., a 2023 MacBook Pro with 36GB RAM).
Web Application
The pre-built artifacts for this mapping database can be downloaded from Zenodo
at and unzipped. The web application can be run
locally on Docker from inside the folder where the data was unzipped with:
$ sh run_on_docker.sh
If you reproduced the database yourself, you can cd
to the right folder and run with:
$ cd ~/.data/semra/case-studies/complex
$ sh run_on_docker.sh
Finally, navigate in your web browser to http://localhost:8773 to see the web application.
Variables
Configuration for the protein complex mappings database |
Gene
The SeMRA Gene Mappings Database assembles semantic mappings to the following resources:
Prefix |
Name |
|---|---|
NCBI Gene |
|
HUGO Gene Nomenclature Committee |
|
Mouse Genome Informatics |
|
Rat Genome Database |
|
Chicken Gene Nomenclature Consortium |
|
WormBase |
|
FlyBase Gene |
|
Saccharomyces Genome Database |
|
Online Mendelian Inheritance in Man |
|
CIViC gene |
|
Unified Medical Language System Concept Unique Identifier |
|
NCI Thesaurus |
|
Wikidata |
Results
The SeMRA Gene Mappings Database is available for download as SSSOM, JSON, and
in a format ready for loading into a Neo4j graph database
on Zenodo at .
A summary of the results can be viewed on the SeMRA GitHub repository in the landscape/gene folder.
Reproduction
The SeMRA Gene Mappings Database can be rebuilt with the following commands:
$ git clone https://github.com/biopragmatics/semra.git
$ cd semra
$ uv pip install .[landscape]
$ python -m semra.landscape.gene
Note
Downloading raw data resources can take on the order of hours to tens of hours depending on your internet connection and the reliability of the resources’ respective servers.
Processing and analysis can be run overnight on commodity hardware (e.g., a 2023 MacBook Pro with 36GB RAM).
Web Application
The pre-built artifacts for this mapping database can be downloaded from Zenodo
at and unzipped. The web application can be run
locally on Docker from inside the folder where the data was unzipped with:
$ sh run_on_docker.sh
If you reproduced the database yourself, you can cd
to the right folder and run with:
$ cd ~/.data/semra/case-studies/gene
$ sh run_on_docker.sh
Finally, navigate in your web browser to http://localhost:8773 to see the web application.
Variables
Configuration for the gene mappings database |
Anatomy
The SeMRA Anatomy Mappings Database assembles semantic mappings to the following resources:
Prefix |
Name |
|---|---|
Uber Anatomy Ontology |
|
Medical Subject Headings |
|
BRENDA Tissue Ontology |
|
Common Anatomy Reference Ontology |
|
NCI Thesaurus |
|
Unified Medical Language System Concept Unique Identifier |
Results
The SeMRA Anatomy Mappings Database is available for download as SSSOM, JSON, and
in a format ready for loading into a Neo4j graph database
on Zenodo at .
A summary of the results can be viewed on the SeMRA GitHub repository in the landscape/anatomy folder.
Reproduction
The SeMRA Anatomy Mappings Database can be rebuilt with the following commands:
$ git clone https://github.com/biopragmatics/semra.git
$ cd semra
$ uv pip install .[landscape]
$ python -m semra.landscape.anatomy
Note
Downloading raw data resources can take on the order of hours to tens of hours depending on your internet connection and the reliability of the resources’ respective servers.
Processing and analysis can be run overnight on commodity hardware (e.g., a 2023 MacBook Pro with 36GB RAM).
Web Application
The pre-built artifacts for this mapping database can be downloaded from Zenodo
at and unzipped. The web application can be run
locally on Docker from inside the folder where the data was unzipped with:
$ sh run_on_docker.sh
If you reproduced the database yourself, you can cd
to the right folder and run with:
$ cd ~/.data/semra/case-studies/anatomy
$ sh run_on_docker.sh
Finally, navigate in your web browser to http://localhost:8773 to see the web application.
Variables
Configuration for the anatomy mappings database |