[DATA_SOURCE]

Locations and names of database dumps to fetch and load locally

DATA_SOURCE]
  ENSEMBL_DB_URL = ftp://ftp.ensembl.org/pub/current_mysql/
  ENSEMBL_DB_REPLACE =
  ENSEMBL_DBS = [ ensembl_accounts ]

  EG_DB_URL = ftp://ftp.ensemblgenomes.org/pub/current/pan_ensembl/mysql/
  EG_DB_REPLACE = 1
  EG_DBS = [ ncbi_taxonomy ensembl_website_84 ]

  SPECIES_DB_URL = ftp://ftp.ensemblgenomes.org/pub/current/metazoa/mysql/
  SPECIES_DB_REPLACE =
  SPECIES_DB_AUTO_EXPAND =
  SPECIES_DBS = [ bombyx_mori_core_31_84_1 ]

  MISC_DB_URL =
  MISC_DB_REPLACE =
  MISC_DBS =
  • ENSEMBL_DB_URL - the URL containing the Ensembl database dumps
  • ENSEMBL_DB_REPLACE - a flag to specify whether to overwrite databases that already exist on the DB_HOST
  • ENSEMBL_DBS - a space separated list of database dump names in square braces. ensembl_accounts is required, all others are optional
  • The equivalent variables may be set for EG_DB_URL to fetch and download EnsemblGenomes database dumps and for MISC_DB_URL to support situations where the required databases are spread across multiple hosts.
  • An additional variable may be set for species databases, SPECIES_DB_AUTO_EXPAND - a space separated list of database types to use as replacement strings for core to facilitate downloading multiple database types for each species in SPECIES_DBS