Resources - Data

DAT1

Chemical Entities of Biological Interest (ChEBI) data. last accessed: 17-Nov-2014. Chemical Entities of Biological Interest (ChEBI) is a freely available dictionary of molecular entities focused on small chemical compounds. ChEBI is part of European Bioinformatics Institute (EBI) and European Molecular Biology Laboratory (EMBL). URL: http://obofoundry.org/cgi-bin/detail.cgi?id=chebi.

DAT2

Climate and Forecast (CF) metadata. last accessed: 17-Nov-2014. Data provided by Lawrence Livermore National Laboratory under contract with US Department of Energy. URL: http://cfconventions.org/standard-names-26.html.

DAT3

DBpedia data. last accessed: 17-Nov-2014. DBpedia is a community effort to extract structured information from Wikipedia and to make this information available on the Web. URL: http://dbpedia.org/About.

DAT4

Encyclopedia of Life (EOL) data. last accessed: 17-Nov-2014. Encyclopedia of Life (EOL) is a free, online collaborative encyclopedia intended to document all of the 1.9 million living species known to science. It aims to build one page for each species, including video, sound, images, graphics, text. Initially, project was sponsored by MacArthur Foundation and Sloan Foundation. Additional sponsors are Field Museum, Harvard University, Marine Biological Laboratory, Missouri Botanical Garden and Smithsonian Institution. URL: http://www.eol.org.

DAT5

European Union E Number data. last accessed: 17-Nov-2014. E numbers are codes for substances which can be used as food additives for use within the European Union and Switzerland (the E stands for Europe). They are commonly found on food labels throughout the European Union. URL: http://en.wikipedia.org/wiki/E_numbers.

DAT6

International Union of Basic and Clinical Pharmacology (IUPHAR) ligands and targets data. last accessed: 17-Nov-2014. IUPHAR ligand data is about a substance that forms a bond with a biomolecule to serve a biological purpose. In protein-ligand binding, ligand usually is a signal-triggering molecule, binding to a site on a target protein. In DNA-ligand binding studies, ligand is usually a small molecule, ion or protein that binds to the DNA double helix. Ontomatica imports ligand and associated target data from International Union of Basic and Clinical Pharmacology (IUPHAR). URL: http://www.guidetopharmacology.org/.

DAT7

Kyoto Encyclopedia of Genes and Genomes (KEGG) data. last accessed: 17-Nov-2014. Kyoto Encyclopedia of Genes and Genomes (KEGG) is a collection of manually curated databases dealing with genomes, biological pathways, diseases, drugs, and chemical substances. KEGG is utilized for bioinformatics research and education, including data analysis in genomics, metagenomics, metabolomics and other omics studies, modeling and simulation in systems biology, and translational research in drug development. URL: http://www.genome.jp/kegg/.

DAT8

MeatTrack data. last accessed: 17-Nov-2014. MeatTrack provide standardized identification and tracking systems for variable measure meat products. Food retailers can use MeatTrack codes for identifying and labeling products, and promote product safety systems. URL: http://www.meattrack.com/.

DAT9

Pest Tracker data. last accessed: 17-Nov-2014. Data provided by Purdue University. URL: http://pest.ceris.purdue.edu/index.php.

DAT10

Plant Health data. last accessed: 17-Nov-2014. Data provided by USDA Animal and Plant Health Inspection Service (APHIS). URL: http://www.aphis.usda.gov/wps/portal/aphis/home.

DAT11

Plant Protection Thesaurus data. last accessed: 17-Nov-2014. Data provided by European and Mediterranean Plant Protection Organization (EPPO). URL: http://www.eppo.org.

DAT12

Plant Viruses data. last accessed: 17-Nov-2014. Data provided by Zhejiang Academy of Agricultural Sciences, People's Republic of China. URL: http://www.dpvweb.net/index.php.

DAT13

RCSB Protein Data Bank (PDB) data. last accessed: 17-Nov-2014. Protein Data Bank (PDB) is a repository for the three-dimensional structural data of large biological molecules, such as proteins and nucleic acids. Data is submitted by biologists and biochemists from around the world and is freely accessible on the Internet. PDB is a key resource in areas of structural biology, such as structural genomics. URL: http://www.rcsb.org/pdb/home/home.do.

DAT14

Royal Society of Chemistry ChemSpider data. last accessed: 17-Nov-2014. ChemSpider is a chemical database owned by the Royal Society of Chemistry. The database contains more than 30 million unique molecules from over 450 data sources. URL: http://www.chemspider.com/.

DAT15

Semantic Web for Earth and Environmental Terminology (SWEET) data. last accessed: 17-Nov-2014. Data provided by Jet Propulsion Laboratory, California Institute of Technology. URL: http://sweet.jpl.nasa.gov/.

DAT16

UN FAO AGRIS data. last accessed: 17-Nov-2014. AGRIS is part of the CIARD (Coherence in Information for Agricultural Research for Development) initiative to create a community for efficient knowledge sharing in agricultural research and development. AGRIS archives and bibliographical databases cover many aspects of agriculture, including forestry, animal husbandry, aquatic sciences and fisheries, and human nutrition from over 100 participating countries. URL: http://agris.fao.org/.

DAT17

UN FAO AGROVOC data. last accessed: 17-Nov-2014. AGROVOC is the corporate thesaurus of the Food and Agriculture Organization of the United Nations (FAO). It covers topics related to the interest of FAO, including agriculture, forestry, fisheries, environment and related domains. AGROVOC is a multilingual resource, available in 19 languages (translations into 5 languages is under development). It contains an average of 40,000 terms in each of the available languages. URL: http://aims.fao.org/standards/agrovoc/functionalities/search.

DAT18

UN FAO CODEX Alimentarius data. last accessed: 17-Nov-2014. Codex Alimentarius (Latin for Book of Food) is a collection of internationally recognized standards, codes of practice, guidelines and other recommendations relating to foods, food production and food safety. The Codex Alimentarius is recognized by the World Trade Organization as an international reference point for the resolution of disputes concerning food safety and consumer protection. URL: http://www.codexalimentarius.net/web/index_en.jsp.

DAT19

UN FAO CODEX INS data. last accessed: 17-Nov-2014. Codex General Standard for Food Additives (GSFA) sets forth the conditions under which permitted food additives may be used in all foods, whether or not they have previously been standardized by Codex. A database provides, in a searchable format, all the provisions for food additives that have been adopted by the Codex Alimentarius Commission. Provisions are searchable by food additive (name, synonym, INS number), by functional class and by food category. URL: http://www.codexalimentarius.org/standards/gsfa/.

DAT20

UN FAO INFOODS data. last accessed: 17-Nov-2014. INFOODS is the International Network of Food Data Systems. It is a worldwide network of food composition experts aiming to improve the quality, availability, reliability and use of food composition data. INFOODS also stands as a forum through which international harmonization and support for food composition activities can be achieved and advocated. URL: http://www.fao.org/infoods/infoods/en/.

DAT21

US EPA Pesticide Code (PC) data. last accessed: 17-Nov-2014. US EPA pesticide chemical codes (PC codes) identify chemical substances that are pesticides. They also identify bio-chemical substances (a.k.a. bio-pesticides), precise atomic varieties of conventional pesticides (i.e. formulations), and degradates formed from regulated pesticides. URL: http://www.epa.gov/opp00001/foia/list/clearedpccode02.htm.

DAT22

US Integrated Taxonomic Information System (ITIS) data. last accessed: 17-Nov-2014. Integrated Taxonomic Information System (ITIS) provides consistent and reliable information on taxonomy of biological species. Database draws from a large community of taxonomic experts. Smithsonian National Museum of Natural History provides content staff. US Geological Survey provides IT services. ITIS primary focus is North American species, but many groups are worldwide and ITIS collaborates with other international agencies to increase global coverage. URL: http://www.itis.gov/.

DAT23

US Library of Congress Subject Headings (LCSH) data. last accessed: 17-Nov-2014. US Library of Congress Subject Headings (LCSH) comprise a thesaurus (a.k.a. controlled vocabulary) of subject headings, maintained by the United States Library of Congress, for use in bibliographic records. LCSH are an integral part of bibliographic control, which is the function by which libraries collect, organize and disseminate documents. LCSHs are applied to every item within a library's collection, and facilitate a user's access to items in the catalog that pertain to similar subject matter. URL: http://id.loc.gov/authorities/subjects.html.

DAT24

US NIH Medical Subject Headings (MeSH) data. last accessed: 17-Nov-2014. NIH Medical Subject Headings (MeSH) is a comprehensive controlled vocabulary for indexing journal articles and books in the life sciences. MeSH is used by the MEDLINE/PubMed article database and by NLM's catalog of book holdings. URL: http://www.nlm.nih.gov/mesh/mtr_abt.html.

DAT25

US NIH NCBI BioSystems data. last accessed: 17-Nov-2014. NCBI BioSystems data is a group of molecules that interact in a biological system. One type of biosystem is a biological pathway, which can consist of interacting genes, proteins and small molecules. Another type of biosystem is a disease, which can involve components such as genes, biomarkers, and drugs. The NCBI BioSystems database (1) serves as a centralized repository of data; (2) connect biosystem records with associated literature, molecular, and chemical data; and (3) facilitate computation on biosystems data. URL: http://www.ncbi.nlm.nih.gov/biosystems.

DAT26

US NIH NCBI Macromolecular Modeling Database (MMDB) data. last accessed: 17-Nov-2014. NCBI Macromolecular Modeling Database (MMDB) is derived from Protein Data Bank (PDB). Value-added features include explicit chemical graphs, computationally identified 3D domains that are used to identify similar 3D structures, as well as links to literature, similar sequences, information about chemicals bound to the structures and more. URL: http://www.ncbi.nlm.nih.gov/structure/.

DAT27

US NIH NCBI PubChem data. last accessed: 17-Nov-2014. NCBI PubChem is a database of chemical molecules and their activities against biological assays. Millions of compound structures and descriptive datasets can be freely downloaded via FTP. PubChem contains substance descriptions and small molecules with fewer than 1000 atoms and 1000 bonds. URL: http://pubchem.ncbi.nlm.nih.gov/.

DAT28

USDA ARS Food and Nutrient Database for Dietary Studies (FNDDS) data. last accessed: 17-Nov-2014. USDA ARS Food and Nutrient Database for Dietary Studies (FNDDS) is a database of foods, nutrient values, and weights for typical food portions. FNDDS is used to analyze data from the survey What We Eat in America (WWEIA), the dietary intake component of National Health and Nutrition Examination Survey (NHANES). USDA National Nutrient Database for Standard Reference (SR) is source of underlying food composition data. URL: http://www.ars.usda.gov/services/docs.htm?docid=12089.

DAT29

USDA ARS Germplasm Resources Information Network (GRIN) data. last accessed: 17-Nov-2014. USDA ARS Germplasm Resources Information Network (GRIN) is an online database of plant, insect, microbial and animal species germplasm. Database manages taxonomic information and common names on more than 500,000 accessions (distinct varieties, cultivars, etc.) of plants covering 10,000 species. URL: http://www.ars-grin.gov/.

DAT30

USDA ARS GrainGenes data. last accessed: 17-Nov-2014. Data provided by USDA ARS GrainGenes: A Database for Triticeae and Avena. URL: http://wheat.pw.usda.gov/GG2/index.shtml.

DAT31

USDA ARS National Nutrient Database for Standard Reference (SR) data. last accessed: 17-Nov-2014. USDA ARS National Nutrient Database for Standard Reference (SR) manages nutrient information on over 8,000 foods. Information is organized by food item, group, or list to find the nutrient information for your food items. URL: http://ndb.nal.usda.gov/.

DAT32

USDA NRCS PLANTS data. last accessed: 17-Nov-2014. Data provided by USDA, NRCS National Plant Data Team, Greensboro, NC 27401-4901 USA. URL: http://plants.usda.gov.

DAT33

USDA NRCS Soils data. last accessed: 17-Nov-2014. Data provided by Soil Survey Staff, Natural Resources Conservation Service, USDA. URL: http://www.nrcs.usda.gov/wps/portal/nrcs/site/soils/home/.

DAT34

University of California, Davis (UC Davis) Marker Assisted Selections in Wheat (MAS Wheat) data. last accessed: 17-Nov-2014. Data provided by USDA National Institute of Food and Agriculture (NIFA) and Department of Plant Sciences, University of California, Davis. URL: http://maswheat.ucdavis.edu/Index.htm.

DAT35

Virus Taxonomy data. last accessed: 17-Nov-2014. Data derived from Virus taxonomy: classification and nomenclature of viruses: Ninth Report of the International Committee on Taxonomy of Viruses. (2012) Ed: King, A.M.Q., Adams, M.J., Carstens, E.B. and Lefkowitz, E.J. San Diego: Elsevier. URL: http://www.ictvonline.org/.