Finding Structures in the Unstructured
The new Cytoscape 3 is based on OSGI, which I barely know anything about. Even though Cytoscape 3 is still in beta, manuals, wikies and tutorials have been flowing around in the web, which could be overwhelming if you want to create a new C3 compatible bundle app. Today I tried to set up an Cytoscape 3 bundle App (aka Plugin in the Cytoscape 2.x world) development environment in Eclipse. To my surprise, it is without any hassle by following a tutorial and a wiki page.
First, by following this tutorial: Create a Bundle App Using IDE , it’s really straightforward to create the app with Maven and Eclipse (M2Eclipse is required). After everything is set up, I made sure that the app was installed by running “mvn install” (either in the app directory from command line or within eclipse) and checked my local maven repository ~/.m2/repository/groupid/architectid (yes I am using Linux).
Then I followed this wiki page: Interactive Shell , and deployed the app and tested it.
First download cytoscape 3:
> wget http://chianti.ucsd.edu/cytoscape-3.0.0-M4/cytoscape-unix-3.0.0-M4.tar.gz > tar xzf cytoscape-unix-3.0.0-M4.tar.gz > cd cytoscape-unix-3.0.0-M4 > ./cytoscape.sh # this will start cytoscape shell and the GUI Cytoscape 3.0.0-M4> install mvn:groupid/artifactid Bundle ID: 162 Cytoscape 3.0.0-M4> start 162
Now go to the Cytoscape GUI, and open Apps menu, and there it is: “Hello World!”
Iorio, F. et al published a paper in the PNAS journal. They built a drug network out of gene expression data from small molecule screens. Using network analysis tools, they were able to group drugs into communities using network analysis. They claim that the compounds in the same community tend to have similar MoA, or act on similar biological pathways. They correctly predicted nine anticancer compounds and discovered an unreported effect for an existing drug.
There are several interesting things in their work. First, they developed a new algorithm to merge the gene expression data from different cell lines given the same compound. This aggregation algorithm uses several techniques such as Spearman’s footrule, Borda Merging Method and the Kruskal Algorithm.
Secondly, they calculated the drug similarity based on the merged list of gene ranks from the previous step based on results from Gene Set Enrichment Analysis (GSEA). This is because the list of expressions are different for two different compounds.
Thirdly, they compared the drug similarity measure with chemical similarity measures using fingerprints or electrotopological descriptors. The results are quite uncorralted with very low pearson correlation coefficient. An interesting extension of this could be the activity cliffs – very similar compounds that having completely different biological activity profile.
One limitation of this method is that you need to have a gene expression profile of a compound to be able to use it, similar to the original connectivity map approach. The other concern is that for compounds that have inconsistent effects on different cell lines. But as the authors mentioned, “when no information on the drug MoA is available a priori, the best strategy is still
to merge profiles from multiple cell lines”.
Iorio, F. et al. Discovery of drug mode of action and drug repositioning from transcriptional responses. Proceedings of the National Academy of Sciences 107, 14621 -14626 (2010).
Cephalosporins disrupts the synthesis of the peptidoglycan layer of bacterial cell walls. The petidoglycan layer is important for cell wall structural integrity. The final transpeptidation step in the synthesis of the peptidoglycan is facilitated by transpeptidases known as penicillin-binding proteins (PBPs). PBPs bind to the D-Ala-D-Ala at the end of muropeptides (peptidoglycan precursors) to crosslink the peptidoglycan. Beta-lactam antibiotics mimic this site and competitively inhibit PBP crosslinking of peptidoglycan.
This week’s molecule is BPA, or bisphenol-A. New York time has a long article about the research over this controversial chemical, mended with some political background. Details at In Feast of Data on BPA Plastic, No Final Answer – NYTimes.com
From the article, there is no decisive results to testify BPA’s effects on human bodies. But it doesn’t mean there is no results. Rather, scientists are still debating over whether BPA is toxic or not, mainly because unreproducible results. Part of the reasons of the conflicting results, as the article pointed out, is that “different laboratories have studied the chemical in different ways”. The conflicting results have even produced conflicts between scientists. It’s interesting to read that “researchers who study it argue bitterly at conferences”, and “At one such meeting, scientists in the audience said, “We don’t want to hear you two speak until you get this straightened out,” “.
The articles goes on with recent development in the research, and mentioned some efforts in reconciling the disputes. One biggest dispute is that low dosage of BPA turns out to have worse effects in animal studies than higher dosages. From endocrinologists’ point of view, it makes sense because BPA is no regular toxicant, but it “acts like a hormone”, and “hormones can act at extremely low doses”.
So much about the article, and here is the chemistry about BPA.
BPA has two phenol functional groups, and it is used to make polycarbonate plastic that are used in almost everything we use that are plastic, including food containers and baby bottles (in some states in the USA, BPA bottles is already banned in Canada, as well as some states in the US).
Why is BPA toxic? Because it can bind to estrogen receptors, though the binding is much weaker than the body’s own estrogen. Estrogen is assotiated with many horrible diseases, such as cancers, obesity, immune system diseases, and behavior problems. BPA could also bind to receptors for male hormone and thyroid hormone. DES (diethylstibestrol) is a disastrous example of an endocrine disruptor, as mentioned in the NYTimes article. It was used as a drug to prevent miscarriage for pregnant women in 1950s. It could bind to estrogen receptors much stronger than BPA, and has caused sever problems such as cancers, and even affects the offsprings of the womens who took it. What’s this chemical?
(Images: BPA, Estrone, and DES. Curtosey of wikipedia)
This summer I am working on a GSoC project (Gene Expression Reader plugin) for cytoscape again, and as a GSoC student, I got a chance to go to the 2010 Cytoscape summer retreat in Ann Arbor Michigan, and met several mentors, and listened to several very interesting talks.
I talked to Alex Pico, my direct mentor on the Express Reader plugin, also Scooter and Daniele. The face to face meeting turns out to be very efficient 🙂 and we clarified several uncertainties in the project design. It’s also interesting to learn some history of the Cytoscape, and link faces to some familiar names in mailing lists.
One of the interesting talk is about a cytoscape plugin that can explore open linked data (LOD), given by Eric Neumann. He talked about a SPARQL plugin that can be used to query LOD data, and gave quite a few case studies on how to use the plugin to query and visualize data that are related to drug discovery. It was related to the work that is being done in the Wild’s group I worked with last year (chem2bio2rdf). Although I am not actively involved in that project any more, it’s exciting to see that the data could be potentially put into real use.
Dr Lee Hood gave a inspiring talk on the future role of systems biology to P4 medicine (predictive, preventive, personalized, participatory). There are several other interesting talks and fun posters too.
Finally, the location of the meeting was in a big research building that was belong to Pfizer, and was recently bought to UM, but was still pretty much vacant. It’s a huge building and very nice facilities, but so quiet as we are the only group of people in it.
Update on 9/3: It seems the videos of all talks are uploaded now (http://cytoscape.wodaklab.org/wiki/CytoscapeRetreat2010)