Yeast Rank Response Configuration¶

The yeast_rank_response tool requires a json configuration file as input. The configuration file specifies the location of the input files and various other settings.

Below, you will find first an example and next a table defining each key. In the definitions table, if a key/value pair is not marked as [REQUIRED], then there is a default which is set in the tool. [REQUIRED] key/value pairs must be present.

Example¶

{
  "binding_data_path": "path/to/binding/data",
  "binding_source": "eg, 'chipexo_1234' or cc_1234'",
  "binding_identifier_col": "identifier_column_name",
  "binding_effect_col": "effect_column_name_or_none",
  "binding_pvalue_col": "pvalue_column_name_or_none",
  "rank_by_binding_effect": false,
  "expression_data_path": "path/to/expression/data",
  "binding_source": "eg, 'mcisaac_1234' or kemmeren_1234'",
  "expression_identifier_col": "identifier_column_name",
  "expression_effect_col": "effect_column_name_or_none",
  "expression_effect_thres": 0.0,
  "expression_pvalue_col": "pvalue_column_name_or_none",
  "expression_pvalue_thres": 0.05,
  "rank_bin_size": 5,
  "normalize": false,
  "output_file": "rank_response.csv",
  "compress": false
}

Definitions¶

Key	Description
`binding_data_path`	[REQUIRED] Path to the binding data file. The `binding_effect_col`, `binding_pval_col`, and ‘gene_id’ are required.
`binding_source`	[REQUIRED] A description of where the binding data comes from. This might just be the data source, eg ‘chipexo’, or it might be a identifier, eg callingcards_17 or cc_17
`binding_identifier_col`	[REQUIRED] Name of the feature identifier column in the binding data.
`binding_effect_col`	[REQUIRED] Name of the effect column in the binding data. Set to `none` if an effect column does not exist.
`binding_pvalue_col`	[REQUIRED] Name of the pvalue column in the binding data. Set to `none` if a pvalue column does not exist.
`rank_by_binding_effect`	`true` or `false`. Defaults to `false` if not provided. Set to `true` to rank by the binding effect column.
`expression_data_path`	[REQUIRED] Path to the expression data file.
`expression_source`	[REQUIRED] A description of where the expressin data comes from. This might just be the data source, eg ‘mcisaac’, or it might be a identifier, eg mcisaac_17
`expression_identifier_col`	[REQUIRED] Name of the feature identifier column in the expression data.
`expression_effect_col`	[REQUIRED] Name of the effect column in the gene expression data. Set to `none` if an effect column does not exist.
`expression_effect_thres`	[REQUIRED] Threshold for effect expression. Set to `none` if an effect column does not exist
`expression_pvalue_col`	[REQUIRED] Name of the pvalue column in the gene expression data. Set to `none` if a pvalue column does not exist.
`expression_pvalue_thres`	[REQUIRED] Threshold for pvalue of effect expression. Set to `none` if a pvalue column does not exist
`rank_bin_size`	Defaults to 5 if not provided. Bin size for rank response.
`normalize`	This is not currently implemented – it is a placeholder for future development when list input of binding/effect data is supported. `true` or `false`.
`output_file`	Path to the output file. Defaults to `rank_response.csv`.
`compress`	Set this flag to gzip the output file. Defaults to `false`