We've got a more professional interface. Needless to say, AI (Gemini) was largely responsible for the improvement. The improvements are not limited to cosmetics...you should receive noticeably faster outputs from several of the tools (particularly "Relevant Studies", which doesn't require a lot of behind-the-scenes calculations). Despite the assistance, it still took about three weeks of work and 6,000 lines of code. The old site actually required fewer lines of code...this is not a testament to my own efficient coding, but rather, Gemini's insistence on bullet-proofing and commenting everything.
There are also several new features. The coolest is this: The "Third Study" tool outputs a nice Venn diagram that you could use for your paper. Graphics generated from bioinformatic sites are rarely, if ever, publication ready, but it's easy enough to right-click on the image and edit it in Photoshop or Illustrator. The sizes of the circles and intersecting regions correspond to the number of genes within, failing to some extent when there are only a few genes in an intersection. Click on an output study, go to the "Venn Diagram" tab, and you'll get something like this:
Here's another new feature: the "Tissue Specificity" box for the "Fisher" tool. Let's say you have a study that compares breast cancer tissue to healthy tissue. You derive a list of DEGs that are upregulated in breast cancer. You could use our Fisher tool to find knockouts, drugs, etc., that tend to downregulate these genes. However, you might suspect that these treatments could cause unpleasant systemic effects. You'd prefer to target genes that are breast-specific. The "Tissue Specificity" choice allows you to do that. Specifically, the tool looks in a table of breast-specific genes and then filters the database for studies in which these genes were specifically targeted. Though not related to tissue-specificity, we've also included a list of genes that can be targeted by existing drugs. More lists are possible.
Another feature is this: "Select Database", seen in the sidebar for several tools. Currently, there are only two choices, our "standard" database and a second database ("Reduced p10"). Here, we've simply taken the standard database and removed the top 10% of most commonly perturbed genes. It's computationally expensive to do this on the fly, thus a revised database. The revised database is an attempt to address the fact that many input studies converge on relatively few database studies. Here, commonly perturbed genes are removed in order to allow the small-time talents to shine. The idea is far from optimized...currently, it seems like removing a mere 10% of genes was probably too conservative, since outputs currently don't seem to be tremendously different for either choice of database. We've got ideas for other database alterations as well.
whatismygene.com
No comments:
Post a Comment