Factors
Affecting Codon Usage in Yersinia pestis
HOU Zhuo-Cheng, YANG Ning*
(
College of Animal Science and Technology, China Agricultural University,
Beijing 100094, China )
Abstract The
complete genome of Yersinia pestis which was the causative agent of the
systemic invasive infectious disease classically referred as plague, had been
recently sequenced. In order to have a further insight into the synonymous
codon usage evolution, factors shaping synonymous codon usage pattern of
Yersinia pestis were analyzed in this paper. The coding sequences larger than
or equal to 300 bp were used in codon usage analysis. Though “G”+”C”
content in Y. pestis genome was slightly lower (47.64%), the highly
expressed genes tended to use “C” or “G” at synonymous
sites compared with lowly expressed genes. Conversely, lowly expressed genes
tended to prefer “A” or “T” at synonymous positions. Gene
expression level was strongly correlated with the first axis of the
correspondence analysis (COA) (R=0.63, P<0.0001). By the analyses of the
codon usage pattern of highly and lowly expressed genes, it was confirmed that
gene expression level was partially responsible for the codon usage bias.
GC-skew analysis showed that codon usage suffered replication-transcriptional
selection. Codon adaptation index (CAI), frequency of "C"+"G"
at the synonymous third position of codon (GC3s) and the effective number of
codons (Nc) values showed some differences among different gene length groups. "G"+"C"
content of genes was strongly correlated with the first axis of the COA (R=0.72,
P<0.0001). It could be concluded that gene expressivity,
replication-transcriptional selection, gene length and gene composition
constraints were the main affecting factors of codon usage variation in Y.
pestis.
Key words codon
usage; correspondence analysis; gene expression level; coding sequence length;
Yersinia pestis
*Corresponding author: Tel, 86-10-628927141; Fax, 86-10-62891351; e-mail, [email protected]
