Word cloud inspires a new way to visualize chemical structures

Drug discovery projects often require analysis and visualization of large number of chemical structures to identify new drug candidates. Growth of chemical databases (Ref. 1) has made millions of chemical compounds available at chemists's fingertips; however, visualizing so many structures in one go is a major challenge. Usually one would scroll through a simple 'list view' or a 'grid view' (see figure 1) of chemical structures, but that would be a time consuming process for a large number of molecules.


Figure 1 : Illustration showing grid view of chemical structures 

In order to address these issues, recently Ertl and Rohde proposed 'molecule cloud' approach to visualize a large set of molecules (Ref. 2). This method was inspired by word cloud (see figure 2) visualization technique often used to depict key words in textual content.


Figure 2 : Illustration showing a word cloud of chemical names

Using a simple word cloud (see figure 2) approach to visualize large number of chemical structures would lead to a messy pile of chemical structures (figure 3), so Ertl et al. decided to take a smarter approach. Since displaying individual molecule as a whole was difficult for a large number of chemical structures, they decided to display common substructures in the molecules in a layout which minimizes overlap between structure images. The size of each substructure image was scaled to the number of molecules containing that substructure in the chemical structure collection i.e. largest image corresponds to most frequent substructure.


Figure 3 : Illustration showing a pile of chemical structures in a random layout

Ertl et al. showcased the usefulness of their methods by rendering molecule clouds for chemical structures in PubChem database, ZINC database and CheMBL database. They also showed how to use color coding in molecule clouds to display biological activity or protein target classes.

Future Applications of Molecule Clouds

Interactive molecule clouds could be an useful starting point for finding chemistry articles with particular chemical structures in journal websites or in bibliographic databases. Molecule clouds could be an attractive way to interact with large number of chemical structures on mobile devices. It could also be integrated in chemical structure databases as an interactive visualization option apart from 'list view' and 'grid view'. 

All the details about molecule cloud visualization of chemical structures can be found here.  

References
--------------------------------
1. Chemistry's web of data expands     
    Richard Van Noorden
    Nature 2012
    http://www.nature.com/news/chemistry-s-web-of-data-expands-1.10328?nc=1341529052776

2. The Molecule Cloud - compact visualization of large collections of molecules
    Peter Ertl and Bernhard Rhode
    J Cheminformatics 2012 
    http://www.jcheminf.com/content/4/1/12 

Old NID
91601

Latest reads

Article teaser image
Donald Trump does not have the power to rescind either constitutional amendments or federal laws by mere executive order, no matter how strongly he might wish otherwise. No president of the United…
Article teaser image
The Biden administration recently issued a new report showing causal links between alcohol and cancer, and it's about time. The link has been long-known, but alcohol carcinogenic properties have been…
Article teaser image
In British Iron Age society, land was inherited through the female line and husbands moved to live with the wife’s community. Strong women like Margaret Thatcher resulted.That was inferred due to DNA…