Synonymous and non-synonymous substitutions
Calculate dN/dS
Warning
This is a novel function and has not yet undergone testing by external users. Please report any bugs or issues to the PanTools team so we can improve it.
This function allows you to calculate synonymous (dS) and non-synonymous (dN) substitutions between homologous and syntenic genes. Nonsynonymous mutations result in protein changes and frequently under selection. Synonymous substitutions do not lead to amino acid changes and are (in general), not under selection. The dN/dS ratio is a metric indicating the evolutionary rate of gene sequences and is commonly used as indicator of selection pressure.
Homology groups are aligned. Synteny block information in the pangenome is used to identify syntenic gene pair to only extract these values from the output.
- Required software
- Parameters
<databaseDirectory>
Path to the database root directory.
- Options
--include
/-i
Only include a selection of genomes.
--exclude
/-e
Exclude a selection of genomes
-—selection-file
Text file with rules to use a specific set of genomes and sequences.
--threads
/-t
Number of parallel threads.
--syntelogs
Calculate substitutions between syntenic genes (default: all homologous genes).
--phasing
Analyze phased genomes.
--scoring-matrix
The scoring matrix to use (default: BLOSUM62).
--homology-file
/-H
Text file with homology_group node identifiers.
--allow-zeros
Allow zeros for dN or dS values.
- Example commands
$ pantools calculate_dn_ds tomato_DB $ pantools calculate_dn_ds tomato_DB --include 1,2 $ Rscript tomato_DB/dnds/rscript/1_2/plot_dnds.R
- Output
Output scripts are available in dn_ds/scripts:
homologs_plot_dnds.R and homologs_plot_log10_dnds.R, R scripts to visualize dN/dS substitutions.