Config file

File location

The config file is in Git: https://github.com/pombase/pombase-config/blob/master/website/pombase_v2_config.json

The following setting are available:

`extension_display_names`

To appear in the "Target of" part of the gene page the annotation must have a extension relation that matches a "rel_name" in this section. Some items have a "if_descendent_of" setting. If that is present then the term in the annotation must be a descendent of the configured term.

eg.

{
  "rel_name": "has_input",
  "if_descendent_of": "GO:0003824",
  "display_name": "has substrate",
  "reciprocal_display": "substrate of"
},

This will match any annotation where the term is a descendent of "GO:0003824" and has "has_direct_input" as a extension relation.

The string in the "reciprocal_display" field is used in the "Relationship" column of the target of gene page section.

The first matching configuration (via if_descendent_of) will be used.

`predefined_queries`

Preconfigured links to the query builder and "Canned queries" in the query builder are configured here.

The format is:

  "<QUERY_NAME>": {
    "name": "<USER_FRIENDLY_NAME>",
    "constraints": "<QUERY_CONSTRAINTS>"
  }

The <QUERY_NAME> can be any string as long it's unique.

The <USER_FRIENDLY_NAME> is a short title or description of the query that can be shown to users. For canned queries (see canned_query_ids below) this friendly name is shown in the "Canned queries" section of the query builder. For other predefined queries the name is currently not used.

The <QUERY_CONSTRAINTS> is the internal JSON representation of the query. This text is available via a hidden menu to the right of each query.

example:

  "physical_interactors_of_wee1": {
    "constraints":{
      "interactors":{
        "gene_uniquename": "SPCC18B5.03",
        "interaction_type": "physical"
      }
    }
  }

Whitespace outside of the quotes isn't significant and the newlines are optional.

This can be used in a query in the documentation like this. The <QUERY_NAME> from predefined_queries is the predefinedQueryName in the query-link, e.g.

<app-query-link [goToResults]="true" [linkText]="'Physical interactors of wee1'"
    [predefinedQueryName]="'physical_interactors_of_wee1'">
</app-query-link>

See Links to the query builder for more on query links.

`reference_page_extra_links`

Extra links to show on reference page. The format is:

"PMID:...": [
   {
     "link": "SOME_URL",
     "text": "LINK_LABEL"
   },
   ...
],
...

`canned_query_ids`

`front_page_panels`

A list of settings for the front page panels. The configuration for each panel is chosen randomly from this list on each visit to the front page.

`title`

The text to include in the title of the panel. Required.

`title_link`

Optional external link URL.

`panel_type`

One of "explore", "community" or "spotlight". Required.

`head_image`

A list of images for the top of this panel. The image will be chosen at random from this list on each visit to the front page. Required.

`content`

Markdown content for this panel. Required.

`link`

A link for the bottom of the panel. Optional.

`link_label`

The text to use for the link, required when link is configured.

`show_on_front_page`

If true, this panel config is a candidate for the front page. If false the panel is only visible in the archive. Required.

`external_gene_references`

See also "Adding a new external reference".

Sub-fields of external_gene_references:

`ref_type`

The string to appear in the first column of the external references table

`feature_types`

This reference will only appear on gene page with these types. Some be an array with the values "mRNA gene" and/or "pseudogene". It can also be "ALL" so it will appear on all pages including RNA gene pages.

`name`

The string to appear in column 2.

`description`

Column 4 contents.

`field_name`

The field of the gene to substitute into the configured URL.

Possibilities:

uniquename: the systematic ID
name: the gene name
uniprot_identifier
orfeome_identifier

and the special case: NCBI_ALL_IDS which will create a string containing name, uniquename and synonyms suitable for a link to PubMed.

`url`

The URL of the external site. The <<IDENTIFIER>> in the string will be replaced with the value of the field set with the field_name setting.

`go_xrf_abbrev`

Instead of providing a url you can provide a database abbreviation that matches an entry in GO.xrf_abbs

`extension_relation_order`

`evidence_types`

`cv_config`

There is a configuration item here for each ontology annotation section on the gene pages.

Attributes:

`feature_type`

This defaults to "gene" and is set to "genotype" for the phenotype annotation sections.

`display_name`

The title to use at the top of the section. eg. "Single allele phenotype"

`filters`

The filters item is a list of the configuration for the filters that should shown for this section.

Attributes:

`display_name`

The name for the filter that is shown to the user.

`filter_type`

A unique identifier for this filter. eg. "evidence"

`detailed_view_only`

Should be "true" for filters the should only be shown in the detail (non-summary) view

`term_categories`

For term type filters this is used to group the annotation by ancestor term. Each category has:

a display_name like "Normal process"
an ancestors list with the ancestor term ID and the relation to follow to get to the ancestor

There can be multiple ancestors per category. The category will contain all terms that have any of the configured ancestors.

`evidence_categories`

Each has:

a display_name like "Western blot"
an evidence_codes list

The evidence codes can be any case. They shouldn't have the word "evidence" at the end. eg. use "Western blot" not "Western blot evidence". That's because we have both versions in Chado (needs fixing) so the web code checks for annotations with and without the " evidence" at the end as the evidence code.

`modification_abbreviations`

Config for abbreviations in the "modified residue" relation on modification extensions.

See: https://github.com/pombase/pombase-chado/issues/1141#issuecomment-1972530354

`summary_relations_to_hide`

`summary_gene_relations_to_collect`

`extra_database_aliases`

This map contains aliases for databases that aren't available in GO.xrf_abbs. These aliases are also used for fixing the database names for output in ProteinFeatures.tsv.

The keys should be lower case and the code will then check for any case. eg. for the configuration "amigo": "GO" the code will look for "amigo", "AMIGO" or "AmiGO" and map it to "GO".

`slims`

This is a map from slim name to slim config. The keys are something like pombe_mondo_slim and the config details are:

`slim_display_name`

Something like "Disease slim" to show as a user friendly name for the slim on the website.

`doc_path`

The path to the documentation for this slim.

`full_slim_path`

The path in the website for this overview path of this slim

`description`

Used when a bit more detail is needed.

`slim_results_description`

Text to added at the top of the results page when a gene set is slimmed. Example: https://www.pombase.org/slim:bp_goslim_pombe/from/id/ae2c8f19-6d8c-43bb-a2ac-dc81665b7292

`cv_name`

The name of the ontology that underlies this slim.

`terms`

This is a list of the top level terms / slim categories. Each element is a term name (name) and a term ID (termid).

`term_display_names`

A map from term name to a tidier name for display purposes. Currently used for the misc. sequence feature table. See: https://github.com/pombase/website/issues/37#issuecomment-765301311

`external_term_link_prefixes`

A list of term prefixes (like "CHEBI") for terms that should be external links rather than links to term details pages. The prefixes need to be added to the "ABBREVS" variable in update_generated_files.sh. The IDs in ABBREVS must be in the GO db-xref file - add a pull request for any missing databases.

`sequence_feature_page`

Configuration for the sequence feature page. See: https://github.com/pombase/website/issues/37

`so_types_to_show`

The SO type names to show on the sequence feature page.

`target_of_config`

Configuration for the "Target of" section on the gene pages.

`ontology_labels`

The text to use in the "Ontology" column of the "Target of" table.

`relation_priority`

These relation priorities determine the order of rows in the table. Rows with an extension relation with a higher priority will appear closer to the top. If a priority isn't configured it defaults to 0.

`file_exports`

Configuration for files exported at the end of the nightly load. These files mostly end up in the "misc" directory.

`site_map_term_prefixes`

We create a sitemap.txt for Google. Only pages for terms starting with these term prefixes are written to that file.

`site_map_reference_prefixes`

References with these IDs are written to sitemap.txt.

`macromolecular_complexes`

Config for writing Complex_annotation.tsv

`parent_complex_termid`

Only annotation for terms that are descendants of this are written.

`excluded_terms`

Terms that aren't written, even if they are descendants of parent_complex_termid.

`rnacentral`

Config for writing rnacentral.json

`export_so_ids`

Only details for genes with these SO types (RNA terms) are written to rnacentral.json

`annotation_subsets`

The exporter has basic support for writing annotation files where the subset and columns to write can be configured. The output files are TSV format.

`term_ids`

Constrains the term IDs of the annotation. Only annotation with one of these terms as the parent will be written.

`file_name`

Name of the file written to the misc directory. eg. "increased_sensitivity_to_chemical.tsv"

`single_or_multi_locus`

Write only the annotation from "single" or "multi" allele genotypes.

`columns`

The column names and types to write to the TSV file, retrieved from the annotation data. name can be "gene_uniquename", "gene_name", "allele", "termid" or "term_name".

The display_name is the column heading written to the TSV file.

`protein_feature_view`

Configuration for the protein feature viewer (eg. https://www.pombase.org/gene_protein_features/SPBC28F2.12)

`modification_extension_rel_types`

A list of extension relation types that should be shown when the user moves the mouse over a modification in the protein feature viewer.

`widget_track_names`

The names of the tracks in the feature viewer.

`modification_groups`

Each of these groups will get a different colour in the feature viewer. The colours are currently hard coded here: https://github.com/pombase/pombase-python-web/blob/master/protein_feature_view/templates/protein_feature_view/index.html

PomBase is funded by the Wellcome Trust

Config file

File location

extension_display_names

predefined_queries

reference_page_extra_links

canned_query_ids

front_page_panels

title

title_link

panel_type

head_image

content

link

link_label

show_on_front_page

external_gene_references

ref_type

feature_types

name

description

field_name

url

go_xrf_abbrev

extension_relation_order

evidence_types

cv_config

feature_type

display_name

filters

display_name

filter_type

detailed_view_only

term_categories

evidence_categories

modification_abbreviations

summary_relations_to_hide

summary_gene_relations_to_collect

extra_database_aliases

slims

slim_display_name

doc_path

full_slim_path

description

slim_results_description

cv_name

terms

term_display_names

external_term_link_prefixes

sequence_feature_page

so_types_to_show

target_of_config

ontology_labels

relation_priority

file_exports

site_map_term_prefixes

site_map_reference_prefixes

macromolecular_complexes

parent_complex_termid

excluded_terms

rnacentral

export_so_ids

annotation_subsets

term_ids

file_name

single_or_multi_locus

columns

protein_feature_view

modification_extension_rel_types

widget_track_names

modification_groups

Clone this wiki locally

`extension_display_names`

`predefined_queries`

`reference_page_extra_links`

`canned_query_ids`

`front_page_panels`

`title`

`title_link`

`panel_type`

`head_image`

`content`

`link`

`link_label`

`show_on_front_page`

`external_gene_references`

`ref_type`

`feature_types`

`name`

`description`

`field_name`

`url`

`go_xrf_abbrev`

`extension_relation_order`

`evidence_types`

`cv_config`

`feature_type`

`display_name`

`filters`

`display_name`

`filter_type`

`detailed_view_only`

`term_categories`

`evidence_categories`

`modification_abbreviations`

`summary_relations_to_hide`

`summary_gene_relations_to_collect`

`extra_database_aliases`

`slims`

`slim_display_name`

`doc_path`

`full_slim_path`

`description`

`slim_results_description`

`cv_name`

`terms`

`term_display_names`

`external_term_link_prefixes`

`sequence_feature_page`

`so_types_to_show`

`target_of_config`

`ontology_labels`

`relation_priority`

`file_exports`

`site_map_term_prefixes`

`site_map_reference_prefixes`

`macromolecular_complexes`

`parent_complex_termid`

`excluded_terms`

`rnacentral`

`export_so_ids`

`annotation_subsets`

`term_ids`

`file_name`

`single_or_multi_locus`

`columns`

`protein_feature_view`

`modification_extension_rel_types`

`widget_track_names`

`modification_groups`