metabolomics-practical

Metabolite Identification

prev toc next

Mass spectrometry

Mass spectrometry is an analytical methods that uses the mass-to-charge ratio to characterize compounds. During a mass spectral analysis it is common that the compound breaks down in smaller fragments, resulting in multiple peaks. The masses of the fragments or compound specific. But even then, with just a mass spectrum it is not easy to identify a compound.

  1. why will we not see lysine (C₆H₁₄N₂O₂) in a mass spectrum?

Monoisotopic masses

The monoisotopic mass is the mass of a compound with all atoms having a single, major isotopic mass for each element. For example, the monoisotopic mass of hydroxyurea is 76.02728.

  1. how many compound in ChEBI have this monoisotopic mass? (use the advanced search functionality)

You can experiment with the range in the advanced search. Making it slightly larger increases the number of search hits. Try searching for the range 76.0 to 76.5.

Molecular formula

Lysine has the molecular formula C₆H₁₄N₂O₂. If the mass spectrum gives us a [M+H]+ peak matching it molecular formula C₆H₁₅N₂O₂+ then it could be lysine. But with only the chemical formula, we do not know how the atoms are bound to each other. That is, we do not know what the correct stereoisomer is. In fact, we do not even know what the correct constitutional isomer is.

  1. how many compound entries does ChEBI return if you search for the chemical formula C₆H₁₄N₂O₂? (use the advanced search functionality)

A chemical structure generator

Given a chemical formula, say C₂H₆O, you can figure out how many chemical structures have that formula.

  1. how many unique chemical structures have the C₂H₆O?

But this number goes up quickly. Toluene has many other chemical structures with the same chemical formula. A tool called Surge can efficiently enumerate them. Not all structures it generates are chemically stable, but many are. PubChem gives you some idea, but this includes isotopic variants. Not part of this practical, but if you try Surge, use the -S option to output SMILES, which you can render as 2D structures with CDK Depict for up to 500 SMILES strings in one go.

  1. Can you draw all 14 chemical structures with the chemical formula C₅H₁₂O?

prev toc next

Copyright 2020-2024 (C) Egon Willighagen - CC-BY Int. 4.0