9.9.1. Data page markup with Bioschemas¶
9.9.1.1. Main Objectives¶
The main purpose of this recipe is:
To markup a webpage representing a data record with
Bioschemas
compliant markup.
9.9.1.2. Graphical Overview¶
9.9.1.3. Capability & Maturity Table¶
Capability |
Initial Maturity Level |
Final Maturity Level |
---|---|---|
Findability |
minimal |
repeatable |
Interoperability |
minimal |
9.9.1.4. Method¶
We will outline the steps for embedding machine processable mark up within a web page in your site. As a specific example, we will use the Wikidata page for BRCA1.
Identify a data page in your site for which you will develop markup
Open the Bioschemas Generator
Select the type of data page that you are marking up, i.e. the primary subject of the page. In the example we will use
Gene
. Here are the Bioschemas profiles that you can choose from1.ChemicalSubstance
for a page about a chemical substance composed of molecular entitiesGene
for a page about a geneMolecularEntity
for a page about a single molecular entityProtein
for a page about a proteinBioSample
for a page about a biological sampleTaxon
for a page about a taxon
Enter the URL of the page in URL box, e.g.
https://www.wikidata.org/wiki/Q227339
. Note that this URL will be used as the identifier for the resource being described in the markup.Click on the
Show Form
button
Complete the profile form with the data relevant for your page. Once completed, click on the
Generate Markup
buttonYou should complete all Minimum properties and as many Recommended properties as possible. You can show/hide properties using the
Additional Properties
buttons.The form defaults to the data type with the first alphabetical character, e.g. for
identifier
this defaults toPropertyValue
butText
orURL
will be more appropriate in most casesThe right side of the screen gives examples for properties, where these have been provided by the Bioschemas profile authors. Click on the
Show
button to see the example for a specific property. Click onMinimum
,Recommended
, orOptional
to expand/contract the section and see the properties contained at that marginality level
You should now see the generated markup in
JSON-LD
format. You can click on theMicrodata
andRDFa
tabs to see the same content rendered in the different formats. However, we recommend the use ofJSON-LD
. For our Wikidata example, we get the following markup
<script type="application/ld+json" >
{
"@context": "https://schema.org",
"@id": "https://www.wikidata.org/wiki/Q227339",
"@type": "Gene",
"alternateName": [
"breast cancer 1, early onset",
"BRCAI",
"BRCC1",
"BROVCA1",
"IRIS",
"PNCA4",
"PPP1R53",
"PSCP",
"RNF53",
"FANCS",
"breast cancer 1",
"BRCA1, DNA repair associated",
"BRCA1 DNA repair associated"
],
"dct:conformsTo": "https://bioschemas.org/profiles/Gene/0.7-RELEASE",
"description": "protein-coding gene in the species Homo sapiens",
"encodesBioChemEntity": {
"@type": "BioChemEntity",
"@id": "https://www.wikidata.org/wiki/Q17487737"
},
"identifier": "https://www.wikidata.org/wiki/Q227339",
"identifier": "Q227339",
"image": {
"@type": "ImageObject",
"@id": "https://upload.wikimedia.org/wikipedia/commons/thumb/e/e1/Protein_BRCA1_PDB_1jm7.png/220px-Protein_BRCA1_PDB_1jm7.png"
},
"isPartOfBioChemEntity": {
"@type": "BioChemEntity",
"@id": "https://www.wikidata.org/wiki/Q220677"
},
"name": "BRCA1",
"sameAs": [
"https://meshb.nlm.nih.gov/#/record/ui?ui=D019398",
"https://www.ncbi.nlm.nih.gov/nuccore/NR_027676",
"http://identifiers.org/ensembl/ENSG00000012048",
"https://www.ncbi.nlm.nih.gov/gene/672",
"http://identifiers.org/ncbigene/672"
],
"taxonomicRange": {
"@type": "Taxon",
"@id": "https://www.wikidata.org/wiki/Q15978631"
},
"url": "https://www.wikidata.org/wiki/Q227339"
}
</script >
Download or copy and paste the generated markup
Make adjustments for any bits that could not be properly entered through the form.
For example, for our generated markup we would change
"encodesBioChemEntity": {
"@type": "BioChemEntity",
"@id": "https://www.wikidata.org/wiki/Q17487737"
},
to
"encodesBioChemEntity": {
"@type": "Protein",
"@id": "https://www.wikidata.org/wiki/Q17487737"
},
You can test that your JSON-LD is valid syntax, and visualise your markup using the JSON-LD Playground.
Once you are happy with your markup, include the
JSON-LD
, script tags and all, at the bottom of your HTML page template. Make sure that this is before the closing</html>
tagReplace the values in your markup with variables that your web page templating system will replace with values from your database. For example, the follow snippet uses variables of the form
%%%PAGEURL%%%
<script type="application/ld+json" >
{
"@context": "https://schema.org",
"@id": "%%%PAGEURL%%%",
"@type": "Gene",
"dct:conformsTo": "https://bioschemas.org/profiles/Gene/0.7-RELEASE",
"identifier": "%%%PAGEURL%%%",
"description": "%%%DESCRIPTION%%%",
...
}
Your site should now generate data pages with embedded markup.
You should complete this process for each different type of data that you include in your site, e.g. ChEMBL would have MolecularEntity
markup on their chemical pages and Protein
markup on their target pages so would need to complete this process twice.
Once you have deployed this on your web server, you can test it with the Bioschemas Validator which scrapes the markup from your page and allows you to test it against various Bioschemas profiles2.
9.9.1.5. FAIRification Objectives, Inputs and Outputs¶
Actions.Objectives.Tasks |
Input |
Output |
---|---|---|
9.9.1.6. Table of Data Standards¶
Data Formats |
Terminologies |
Models |
---|---|---|
9.9.1.7. References¶
References
1: If you do not see your data type listed, tick the Include Draft Profiles
to see if there is a draft profile for your data type. Otherwise contact the Bioschemas community to suggest development of a new profile.
2: The Bioschemas Validator is currently in an early alpha release and does not include all the profiles.
9.9.1.8. Authors¶
Authors
Name |
ORCID |
Affiliation |
Type |
ELIXIR Node |
Contribution |
---|---|---|---|---|---|
Heriot Watt University |
Writing - Original Draft |
||||
ZB MED Information Centre for life sciences |
Writing - Review & Editing |
||||
University of Oxford |
Writing - Review & Editing |