Simple Search


We will illustrate a simple search by searching for proteins that are targeted by the drug Aspirin.


  • 1.Type Aspirin into the search box. Select Aspirin (Drug) from the Autosuggest drop-down menu.
  • Important: Clicking Search at this point will retrieve all known relationships to Aspirin.
  • 2.Type Protein into the search box and select Protein from the menu.
  • 3.Now select Targets as the relationship you interested in.
  • 4. Click Search. Aspirin’s protein targets will be presented in the Protein facet in the RESULTS tab.

NOTE 1: The search term could be concepts such as Genes, Proteins, Diseases, Drugs etc. or could even be specific names such as BRCA1, MAPK, Sitagliptin etc.

NOTE 2: DistilBio interprets the queries as a graph where the nodes are the search terms and the selected relationship is the edge. The query graph can be viewed in the QUERY tab.



Link to Result

Advanced Search


DistilBio allows the user to ask complex queries and discover new and interesting connections across the data. An advanced search can be done by typing the “concept” and further extending the query by adding associated connections/concepts to the query. The “concept” could be a drug, gene, protein, disease, compound or properties related to these. The available connections for a concept are displayed by the autosuggest feature.
Currently, the “instant” mode cannot be disabled. The user can view and modify the query nodes by clicking on the arrow above the query box.
Some of the complex queries that can be asked are:

What are the common drug targets between Aspirin and Acetaminophen?
Type “Aspirin” in the search box and select the <type> drug. Aspirin in displayed in the search box with a green background. In the result interface below, all the connections available for Aspirin are displayed. The query can be extended by typing “protein” and selecting <type> protein from the dropdown. In the result interface, only the proteins targeted by Aspirin are displayed. Now type “acetaminophen” and select <type> drug. This completes the query and the results for the common drug targets between Aspirin and Acetaminophen are displayed in the result interface.

Link to Result
What are the protein targets of the drug Sitagliptin and what are the proteins interacting with these targets?
Type “Sitagliptin” in the search box and select <type> drug. Type “protein” and select <type> protein from the dropdown. Again type “protein” and select type protein. This displays the protein targets of the drug Sitagliptin and also the proteins interacting with the protein targets.

Link to Result
What are the compounds targeting the protein CP19A and the assays associated with the compound? Also find assays performed only in humans
Type “CP19A” in the search box. There are 19 matches for CP19A. To select the protein in human type “CP19A_human” or select from the drop-down. Now type “compound” and select the <type> compound. Type “assay” and select assay from the drop-down list. To find assays performed only in human, now type “human” and select <organism> human. This displays the results instantly below.

Link to Result

Data


DistilBio covers the following databases currently, and we're working on adding more.

Database Connections/Content
Swissprot Protein->GO-Molecular Function, Protein->GO-Biological Process, Protein->GO-Cellular Component, Protein->Organism, Protein->Gene, Protein->Sites, Protein->Regions, Protein->Pathway, Protein->Protein Interaction, Protein->Disease, Protein->Publication, Protein->Protein Properties
Drugbank Drug->Protein, Drug->Drug Interactions, Drug->Patent, Drug->Publication, Protein->Publication, Drug->Drug Properties
OMIM Disease->Publication, Disease->Disease Properties
PharmGKB Drug->Disease, Drug->Publication, Disease->Publication
CTD Drug->Disease, Drug->Publication, Disease->Publication, Cross-Refs
Entrez Gene Gene->Gene, Gene->Disease, Gene->Organism, Gene->Publication, Gene->GO-Molecular Function, Gene->GO-Biological Process, Gene->GO-Cellular Component, Disease->Publication, Gene->Gene Properties, Cross-Refs
Chebi Compound->Patent, Compound->Publication, Compound->Compound Properties
Chembl Activity->Publication, Activity->Assay, Assay->Publication, Assay->Protein, Assay->Organism, Compound->Protein, Compound->Assay, Compound->Activity, Compound->CellLine, Compound->Organism, Compound->Publication, Compound->Tissue, Compound->Compound Properties
ChemSpider Cross-Refs for Compounds and Drugs
BioGRID Gene->Gene Interaction, Gene Interaction->Publication
IntAct Protein->Protein Interaction, Protein Interaction->Publication
MINT Protein->Protein Interaction, Protein Interaction->Publication
HomoMINT Protein->Protein Interaction, Protein Interaction->Publication
VirusMINT Protein->Protein Interaction, Protein Interaction->Publication
GO GO-Molecular function, GO-Biological Process, GO-Cellular Component, Labels and Names
INOH Protein Classification
MeSH Mesh terms
NCBI Taxonomy Organism Labels and Names
InterPro Protein->Publication, Protein Domain->Publication, Protein->GO-Cellular Component, Protein Domain->GO-Biological Process, Protein->Protein Domain, Protein->GO-Molecular Function, Protein Domain->GO-Cellular Component, Protein->GO-Biological Process, Protein Domain->GO-Molecular Function
Pubmed Publication Titles
Cell Image Library Image->GO-Molecular function, Image->GO-Biological Process, Image->GO-Cellular Component, Image->Imaging Mode, Image->Organism, Image->Source Of Contrast, Image->Image Type, Image->Parameters Imaged, Image->Cell Type, Image->Publication, Cell Type->Image,  Image->Visualization Method, Cross-Refs
Cosmic* Cell Line->Publication, Cell Line->Tissue, Cell Line->Disease, Cell Line->Gene, Mutation->Publication, Mutation->Gene, Mutation->Cell Line
CGP* Cell Line->Tissue, Cell Line->Disease, Cell Line->Gene, Cell Line->Publication, Mutation->Gene, Mutation->Cell Line, Mutation->Publication, Cross-Refs
NCI* Protein->Disease, Protein->Drug, Gene->Drug, Gene->Publication, Gene->Disease, Disease->Publication, Protein->publication, Drug->Publication
NCI-DTP* Screening Study->Cell Line, Compound->Cell Line, Drug->Cell Line, Drug->Screening Study, Compound->Screening Study, Disease->Publication, Drug->Publication, Drug->Disease, Cross-Refs
CCLE* Drug->Screening Study, Gene->Mutation, Gene->Cell Line, Cell Line->Tissue, Cell Line->Disease, Cell Line->Mutation, Mutation->Tissue, Compound->Screening Study, Screening Study->Cell Line
ICGC* Mutation -> Gene, Disease -> Gene, Disease -> Patient, Patient -> Specimen, Specimen -> Specimen Properties, Mutation -> Mutation Properties, Patient -> Patient Properties, Cross-Refs
Methdb* Gene -> GenomicRegion, GenomicRegion -> Experiment, GenomicRegion -> MethylationPattern, GenomicRegion -> Tissue, GenomicRegion -> MethylationProfile, GenomicRegion -> MethylationContent, Experiment -> Experiment Properties, GenomicRegion > GenomicRegion Properties, MethylationContent -> MethylationContent Properties, MethylationPattern -> MethylationPattern Properties, MethylationProfile -> MethylationProfile Properties
String* OrthologousGroup -> Protein, Protein -> Protein Orthologs

* Newly added databases.

Connections
  • Drug-Disease
  • Drug-Protein
  • Drug-Gene
  • Drug-Drug
  • Drug-Publication
  • Drug-Patent
Data Properties
  • AHFS code
  • ATC code
  • Biotransformation
  • Brand name
  • CAS number
  • Category
  • Clearance
  • Description
  • Disease indication
  • Group
  • Inchi
  • Inchi key
  • IUPAC
  • Half life
  • Mechanism of action
  • Molecular weight
  • Protein binding
  • Route of elimination
  • SMILES
  • Absorption
  • Affected Organism Name
  • Brand Mixtures
  • Food Interactions
  • LogP
  • Manfacturer
  • Molecular Formula
  • Pharmacodynamics
  • pKa
  • Volume Of Distribution
  • Water Solubility
Connections
  • Disease-Drug
  • Disease-Protein
  • Disease-Gene
  • Disease-Publication
  • Disease-Image
  • Disease-Patient
Data Properties
  • Animal model
  • Biochemical features
  • Clinical features
  • Description
  • Diagnosis
  • Inheritance
  • Pathogenesis
Connections
  • Gene-Protein
  • Gene-Disease
  • Gene-Drug
  • Gene-Pathway
  • Gene-Organism
  • Gene-Publication
  • Gene-Gene
  • Gene-GOTerms
  • Gene-GenomicRegion
Data Properties
  • Chromosomal location
  • Description
  • Map location
  • Symbol
Connections
  • Protein-Disease
  • Protein-Drug
  • Protein-Pathway
  • Protein-Protein
  • Protein-Publication
  • Protein-GO
  • Protein-Compound
  • Protein-Assay
  • Protein- Activity
  • Protein-Gene
  • Protein-ProteinDomain
  • Protein-ActiveSite
  • Protein-BindingSite
  • Protein-Calcium Binding Region
  • Protein-Coiled-Coil Region
  • Protein-Compositional Bias Region
  • Protein-DNA Binding Region
  • Protein-Glycosylation Site
  • Protein-Intramembrane Region
  • Protein-Lipid Binding Region
  • Protein-Metal Binding Site
  • Protein-Motif
  • Protein-Mutagenesis Site
  • Protein-Nucleotide Binding Region
  • Protein-Transmembrane Region
  • Protein-Zinc Finger Region
  • Protein-Protein(Orthologs)
Data Properties
  • Sequence
  • Sequence Length
  • Sequence Mass
  • Sequence Similarity
  • Function
  • Related Gene Name
  • Related Pathway Description
  • Related Polymorphism Description
  • Related Disease Description
  • Recommended FullName
  • Alternative FullName
  • Recommended ShortName
  • Alternative ShortName
  • Sequence Similarity
  • Induction Description
  • Developmental Stage Expression Description
  • Subcellular Location Description
  • PTM
  • Tissue-Specificity Description
  • Subunit Structure Description
  • Domain Description
  • Cofactor Description
  • Disruption Phenotype Description
  • Enzyme Regulation Description
  • RNA-Editing Description
  • Catalytic Activity
  • KM
  • pH-Dependence
  • Vmax
  • Temperature-Dependence
  • Redox Potential
  • Pharmaceutical Use
  • Biotechnological Use
  • Allergen Property
  • Toxic Dose
  • pH-Dependence
  • Redox Potential
  • RNA-Editing Description
  • Temperature-Dependence
Connections
  • Compound-Assay
  • Compound-Patent
  • Compound-Cell line
  • Compound-Tissue
  • Compound-Organism
  • Compound-Activity
  • Compound-Publication
Data Properties
  • Acidic pKa
  • ALogP
  • Basic pKa
  • Canonical SMILES
  • Charge
  • Chemical structure
  • Cescription
  • H-bond acceptor
  • H-bond donor
  • Inchi
  • Inchi key
  • IUPAC
  • Log D
  • Log P
  • Mass
  • Molecular formula
  • Molecular species
  • Molecular weight
  • SMILES
  • Standard Inchi
  • Standard Inchi key
Connections
  • Mutation-Gene
  • Mutation-Tissue
  • Mutation-Cell line
  • Mutation-Publication
Data Properties
  • Somatic Status
  • GRCh genome Position
  • NCBI genome Position
  • CDS Mutation
  • Codon Change
  • Amino Acid Mutation
  • Validation platform
  • Sequence Coverage
  • Validation status
  • Read count
  • Platform
  • Tumour genotype
  • Reference genome allele
  • Consequence type
  • Nucleotode change
  • Type
Connections
  • Cell Line-Tissue
  • Cell Line-Disease
  • Cell Line-Mutation
  • Cell Line-Gene
  • Cell Line-Publication
Data Properties
  • Expression Array
  • SNP Array
  • Patient Age
  • Patient Gender
  • Institute Address

Examples


Sample queries to get you started. Click on the links to try them out!!
  • Aspirin - drug details

    Complete relationship profile and details for Aspirin

  • Aspirin's protein targets

    Which proteins does aspirin target?

  • Letrozole - Compound details

    All information about the compound Letrozole

  • Compounds targetting CP19A Protein

    All compounds targetting protein CP19A & associated assays

  • Sitagliptin targets

    Find the target proteins of drug Sitagliptin

  • Sitagliptin targets and their interaction

    Find the proteins interacting with the target proteins

  • Common drug targets

    Do aspirin and acetaminophen have any protein targets in common?

Frequently Asked Questions


Search

1. How do I get started?
2. How complex can my query be?
3. What does the graph above the search box represent?
4. Why do the results change as I build my query?
5. Can I use the results to make a new query?
6. How do I clear a query?
7. Where can I find my recent searches?

Results

1. Can I have a brief overview of the results page?
2. How do I minimize/maximize the facets?
3. How do I select items listed in a facet?
4. What are the various ways of filtering the results?
5. I’d like to see the query graph in the results page. Where is it?
6. Why can’t I find any data for my query?
7. How do I see more results than the ones displayed in the facet?
8. How do I find evidence for a triple?
9. What are the various connections available?
10. Why don’t I see some of the results I found previously?
11. How are the publications shown in the Evidence Card relevant?
12. Can I view the structure of a protein?

Data

1. What are the databases currently covered by DistilBio?
2. What are the relationships currently covered by DistilBio?
3. How often do you update the data?
4. Does DistilBio curate data?
5. What does “DistilBio inferred” mean?
6. What does the connection "associated" mean?
7. What does the connection "related" mean?
8. Where can I find the source of the data?
9. What if I find an error in the data?
10. Can I suggest a database to be included in DistilBio?