CDK CxSMILES
Egon L. Willighagen, Adriano Rutz
© 2020-2024 The Authors
License: CC-BY 4.0 International
Origin
At the Dagstuhl Computation Metabolomics meetings in
2020 [1] and
2022 [2]
there were sessions about core structures, structure identity certainty, and identification.
Acknowledgments
We thank everyone who contributed to the discussions at the two Computation Metabolomics meetings.
In 2020 this was Marcus Ludwig, Steffen Neumann, Michael Witting, Marcus Ludwig, Corey Broeckling,
Laurent Gatto, Xiuxia Du, Alexis Delabriere, Jan Lisec, and Andrea Brunner, and in 2022 this was
Soha Hassoun, Roman Bushuiev, Louis-Felix Nothias, Sebastian Böcker, Justin van der Hooft, and
Adriano Rutz. Also thanks to Emma Schymanski, John Mayfield, and Carolin Huber for online replies.
Contents
- Introduction
1.1. CxSMILES in Wikidata
1.2. CDKDepict Gadget
- CxSMILES in the CDK
2.1. Parsing CxSMILES
2.1.1. Groovy
2.1.2. Python
2.1.3. CDKDepict
2.2. The internal graph model
2.3. Writing as an SD file
- Classes of compounds and where to find a CxSMILES
3.1. R-groups
3.2. Lipids
3.2.1. Lipids with a double bond somewhere in the tail
3.2.2. Lipids with two double bonds somewhere in the tail
3.2.3. Lipids with two tails
3.3. Positional uncertainty
3.3.1. Flavonoid
3.4. Polymers
3.4.1. Co-polymers
- Creating CxSMILES
4.1. Step 1: the core structure
4.2. Step 2: the side chains
4.3. Step 3: the attachment points
- Depiction of CxSMILES
5.1. Groovy code
- Liebisch abbreviations
6.1. CXSMILES
6.1.1. Some examples
6.2. Structure enumeration
- Enumerating chemical structures for a CxSMILES
7.1. Filtering by molecular formula
Index
References
- Ludwig M, Neumann S, Willighagen E. Cheminformatics for Users. In: Computational Metabolomics: From Cheminformatics to Machine Learning (Dagstuhl Seminar 20051). 2020.
- Broeckling C, Ebbels T, Mathé E, Zamboni N, Wieder C. Computational Metabolomics: From Spectra to Knowledge (Dagstuhl Seminar 22181) [Internet]. Vol. 12, Dagstuhl reports. 2022. 16 p. Available from: https://drops.dagstuhl.de/opus/volltexte/2022/17441/pdf/dagrep_v012_i005_p001_22181.pdf doi:10.4230/DAGREP.12.5.1 (Scholia)