NIAID Banner Logo Image

User Guide

Quick Tips

  • On our home page, you will find a Simple Search option that allows you to perform a quick text search through the database for chemical or literature entries by keyword.
  • From the home page or the left-hand sidebar, you can navigate through multiple Advanced Search options, allowing you to tailor your search according to chemical properties, biological properties or information about a specific literature reference.
  • Please be patient, as searches can take up to a minute. You can keep track of your query's progress by watching the status bar in your browser window.
  • Your search session will time out after 20 minutes of inactivity.
  • We cannot provide samples of any compound described in this database. In some cases, samples may be available through the NCI/DTP Open Chemical Repository.

Simple text search

  • The home page of ChemDB features a Simple Search option.
  • After you enter a keyword, Simple Search searches all fields in our database for your keyword(s).
  • Simple Search is not case sensitive.
  • You can select to perform your Simple Search in either our reference library or compound database.

For example, if you type reverse transcriptase into the Simple Search box on our home page, leaving the Compounds option selected, and hitting Search, every field in the database will be searched for the term reverse transcriptase. Your results will include entries with the words reverse transcriptase in the "target" or "enzyme" fields. You can perform another search with the same keywords, reverse transcriptase, and select the Literature option before hitting Search. In this case, your results will come from our literature database and will include all articles with the term reverse transcriptase in the title.

Advanced search options

  • The Advanced Search page can be accessed through either the chemical or biological search portals on the left-hand sidebar of the home page.
  • The only difference between the chemical and biological portals is the initial configuration of the search page.
    • The Chemical portal is configured with expanded Compound Data and Structure Search sections.
    • The Biological portal is configured with expanded Cell Based Anti-HIV Assay Data, Anti-HIV Enzyme Inhibition Data and Anti-Opportunistic Infection Data sections.
  • If you wish to search on both biological and chemical parameters, you can expand any closed section yourself by clicking on the red icon to the left of the section heading.
  • If you minimize a section, any search criteria you entered into that section will be lost.

Boolean

Our Advanced Search tool gives you the option of executing a Boolean search. Boolean search lets you look for combinations of up to three criteria using Boolean connector terms (i.e., AND, OR, AND NOT).

  • You must select at least two search criteria before attempting to specify a Boolean connector term.
  • After you specify your search criteria, you can modify the search logic by choosing your fields and operators from the drop down lists in the Boolean Search section.
  • By default, AND is used for all search criteria combinations.
  • If you select more than three search criteria, AND will automatically be used for all connector terms.
  • Here is a detailed explanation of Boolean connector terms:
AND: Narrows search and retrieves records containing all of the fields it separates. In the example above, the AND operator returns only those results which contain both Search Field #1 and Search Field #2.AND
AND NOT: Use NOT to exclude a term. Records with the first term will be retrieved, but any records with the second term will be eliminated.AND NOT
OR: Broadens the search and retrieves records containing any of the fields it separates. In the example above, the OR operator returns all of the results of Search Field #1 and all of the results of Search Field #2 (including those which contain both).OR

Chemical Search Portal

The chemical portal opens with the Compound Data and Structure Search sections expanded.

Compound Data

  • Chemical Name: This search parameter offers a great deal of flexibility, allowing you to search for the exact name, for words the name begins with or ends with, for words the name contains, or for other names that sound like the name for which you are searching. You can search by the IUPAC name or by common or alternate names at the same time (default setting, Names and Synonyms); you can search for just the IUPAC name (Name only), or for only the common and alternate names (Synonyms only).
    The chemical names displayed on this website are generated by Accelrys’ Isentris program, using the Open Eye library and IUPAC standards. Names are generated for as many compounds as possible, but Isentris is unable to name every compound on the website. If the program fails to generate the name for any part of the structure, it displays the symbols, [?], for that portion of the compound’s name. We are unable to QC every name generated by Isentris, so if you run across an error, please let us know and we will correct our records. For peptides and oligonucleotides the sequence is used as the chemical name in place of an Isentris generated name.
  • Company: Sometimes entries contain data on a company affiliated with a certain compound, typically because the company tested the compound, markets the drug or has a patent on the compound.
  • Chemical Class: Each compound is classified based on functional groups that are part of that compound's structure.
  • Target Class: In some cases, compounds are classified based on the organism, biological process or protein that compound acts upon.
  • AIDS#: A unique identification number assigned to each chemical substance or to each combination of two or more chemical substances that constitutes one record in the Division of AIDS Anti-HIV/OI/TB Therapeutics Database.
  • NSC#: A National Cancer Institute (NCI) internal identification number assigned to each distinct chemical substance recorded in their database.
  • Molecular Formula: This is a standard scientific notation that lists the kind and number of atoms in a molecule. For example, water has two hydrogen atoms and one oxygen atom and its molecular formula is H2O. To search for molecular formula in this database, it should be entered with one space between the elements. In addition, please note that the elements of any molecular formula (which ever ones are present) should be entered only in the following order: C H Br Cl F I N O P S. For example, the proper search format for water would be H2 O.
  • CAS#: A unique identification number assigned by the Chemical Abstracts Service (CAS) to each distinct chemical substance recorded in the CAS Chemical Registry System. This number may contain up to 9 digits.
  • Molecular Weight: The sum of the atomic weights of all the atoms in a molecule. You have three options when searching molecular weight: molecules with a molecular weight equal to or less than your search entry (<=), molecules with a molecular weight equal to or greater than your search entry (>=), or a Range of molecular weights. When searching a range of molecular weights a second field will automatically appear, such that you should enter the bounds of the range into individual fields.
  • LogP: The partition coefficient for a neutral molecule calculated automatically based on the structure of the molecule by an algorithm from Symyx, described here. You have three options when searching LogP: molecules with a LogP equal to or less than your search entry (<=), molecules with a LogP equal to or greater than your search entry (>=), or a Range of LogP values. When searching a range of LogP values a second field will automatically appear, such that you should enter the bounds of the range into individual fields.
  • Lipinski Score: Christopher Lipinski and his co-authors reported a method of predicting the absorption and permeability properties of an orally administered compound based on cut-off values for four parameters: LogP (where P is the octanol-water partition coefficient), H-bond donors, H-bond acceptors and the molecular weight. The "Rule of Five" is derived from the observation that if all four parameter values for a compound are less than five (LogP and H-bond donors) or multiples of five (ten for H-bond acceptors and five hundred for molecular weight), then this compound is predicted to have good absorption or permeability properties.[1] Listed below are Lipinski's rules:
    • Not more than 5 hydrogen bond donors (nitrogen or oxygen atoms with one or more hydrogen atoms)
    • Not more than 10 hydrogen bond acceptors (nitrogen or oxygen atoms)
    • A molecular weight under 500 Daltons
    • An octanol-water partition coefficient log P of less than 5

    The SCORE calculated and posted in this database is defined as the number of parameters (out of four) that are less than or equal to the cut-off values originally proposed by Lipinski's "Rule of Five." This database shares this score to aid the user in searching for compounds that may demonstrate favorable absorption or permeability properties (oral bioavailability). Any compound with a score greater than or equal to two may have oral bioavailability. These scores are only a prediction about a compound's oral bioavailability; therefore, other criteria should be considered when determining a compound's usefulness as a drug. Exceptions will always exist, such as substrates for transporters and natural products.

Structure Search

MarvinSketch: MarvinSketch is an advanced, Java-based chemical editor for drawing chemical structures from ChemAxon. To enter a structure, click on the structure input box and the editor will pop up in a new window. A detailed user's guide for Marvin Sketch can be found by following this link.

  • SMILES string: Simplified molecular input line entry specification (SMILES) strings are linear strings that unambiguously encode the two dimensional or three dimensional structure of a compound. You can upload a structure into the MarvinSketch editor by pasting a SMILES string into the SMILES String search field and then pressing Load SMILES.

After entering or uploading a structure into MarvinSketch, you can perform a database structure search using one of these operators:

  • Similarity: Finds compounds that are structurally similar to your structure, using the degree of similarity specified.
  • Flexmatch: Finds compounds that match your structure given the flexibility option specified.
  • Substructure Search: Finds compounds that contain your structure. A substructure is a portion of a larger molecular structure.
  • Exact Match: Finds compounds that are an exact match of your structure.

Biological Search Portal

The biological portal opens with the Cell Based Anti-HIV Assay Data, Anti-HIV Enzyme Inhibition Data, and Anti-Opportunistic Infection Data sections expanded.

Cell Based Anti-HIV Assay Data

  • HIV Target: The enzyme or cellular process at which the inhibitor or drug is targeted in a cellular anti-viral assay aimed at inhibiting or preventing HIV proliferation.
  • Cell Type: Cell line used for cellular anti-HIV assay.
  • EC50: For the purposes of this database, EC50 reflects the concentration of compound necessary to inhibit HIV proliferation by 50%.
  • IC50: For the purposes of this database, IC50 reflects the concentration of compound necessary to inhibit host cell growth by 50% when describing a Cell Based Anti-HIV assay.
  • TI:: This stands for Therapeutic Index, which is a ratio of the therapeutic dose of a compound (the dose at which it inhibits target activity by 50%) and the lethal dose of a compound (the dose at which it inhibits the growth of host cells by 50%). Using our nomenclature, TI=IC50/EC50.

Anti-HIV Enzyme Inhibition Data

  • Enzyme: Historically, this field listed the name of a specific enzyme (RDDP, DDDP, RT-K103N, F227L, etc.) used in the assay. In protein binding studies, the name of the protein:ligand (GP120:CD4, CCR5:RANTES and others) is cited in this field. The database development team is currently evaluating the utility of this field. Future versions of the database may merge this field with the Target field described below.
  • Target: Historically, this field listed the name of the HIV infection- or replication-related viral or cellular component(s) and or event(s) on which the compound either shows its effect or is expected to show its effect in cell-based studies. Examples include: Reverse Transcriptase (RT), Protease (PR), Integrase (INTG), GP120, GP41, CXCR4, CCR5, CD4, NCP7, TAT, REV, RRE, etc. The database development team is currently evaluating the utility of this field. Future versions of the database may merge this field with the Enzyme field described above.
  • IC50 or Ki: This field searches both IC50 data and Ki data for Anti-HIV enzyme assays. For the purposes of this database, IC50 reflects the concentration of compound necessary to inhibit targeted enzyme activity 50% when describing an enzyme inhibition assay. Ki is the inhibitor dissociation constant. The database development team is currently evaluating the utility of this search option. Future versions of the database may separate this search field into two separate search functions.

Anti-Opportunistic Infection Data

  • Pathogen: Microorganisms and viruses targeted by a compound in a growth inhibition assay.
  • OI Target: Protein or pathway targeted by a compound in a growth inhibition assay.
  • Enzyme Inhibition: A relative rating of the activity of a compound against an enzyme. The database development team is currently evaluating the utility of this search option. Future versions of the database may remove this field.
  • Growth Inhibition: A relative rating of the activity of a compound against a pathogen. The database development team is currently evaluating the utility of this search option. Future versions of the database may remove this field.
  • In vivo activity: A relative rating of the activity of a compound against a pathogen in an in vivo model. The database development team is currently evaluating the utility of this search option. Future versions of the database may remove this field.
  • MIC or IC50: This field searches both the Minimum Inhibitory Concentration and IC50 data entered for opportunistic infections. The database development team is currently evaluating the utility of this search option. Future versions of the database may separate this search field into two separate search functions.

Literature Search Portal

  • LitRef#: A unique identification number assigned to each reference indexed in the Division of AIDS Anti-HIV/OI/TB Therapeutics Database.
  • Year: The year an article was published.
  • Author: Perform a simple text search of all authors listed in the database. Comma separated words are searched independently; for example Smith, Sam would search for Smith and Sam. In most cases, the full author list for each paper has been indexed in our database.
  • Title: Perform a simple text search of all titles in the database. It is not necessary to enter the entire title; however, entering part of a title will return all articles containing the exact sequence of words entered.
  • Journal: Journal names are primarily entered in abbreviated format, consistent with National Library of Medicine abbreviations. Searching for an unabbreviated journal name will return an incomplete list of results.
  • Volume: Journal volume in which the source article appeared.

Literature Availability

Many articles are available in PDF format through this website to NIH staff on the NIH network. Additionally, NIH staff can request paper copies of any paper referenced in this database through this contact form. We cannot provide papers to non-NIH personnel.

Surveillance Memos

We publish a biweekly literature surveillance memo that identifies relevant research published on pre-clinical experimental therapies for HIV, opportunistic infections (OIs) associated with AIDS, and other viral pathogens. Where available, URL links to article abstracts are included in the memo. Abstract links to PubMed (ncbi.nlm.nih.gov) are available to all users. Abstract links to Web of Science (http://www.isiknowledge.com) or Science Direct (sciencedirect.com) are only available to NIH staff inside the NIH firewall or to other registered users of these Websites.




[1] Christopher A. Lipinski, Franco Lombardo, Beryl W. Dominy and Paul J. Feeney, "Experimental and computational approaches to estimate solubility and permeability in drug discovery and development settings." Adv. Drug Delivery Rev., 1997, 23(1-3), 3-25.  

If using ChemDB information for publication or abstract presentation, please cite the website in addition to the original information source.    Database last updated: September 2024