|
|
Original Paper
|
|
|||
Acta Biochim Biophys
Sin 2007, 39: 317-325 |
||||
doi:10.1111/j.1745-7270.2007.00282.x |
Comparative Analysis of
Two-component Signal Transduction System in Two Streptomycete Genomes
Wu WEI1,3#, Weihua WANG2#, Zhiwei CAO3#, Hong YU3, Xiaojing WANG1, Jing ZHAO3, Hao TAN3, Hao XU3, Weihong JIANG2*, and Yixue LI1,3*
1
Bioinformation Center, Key Lab of Systems Biology, Shanghai Institutes for
Biological Sciences, Chinese Academy of Sciences, Graduate School of the
Chinese Academy of Sciences, Shanghai 200031, China;
2 Laboratory of Molecular
Microbiology, Institute of Plant Physiology and Ecology, Shanghai Institutes
for Biological Sciences, Chinese Academy of Sciences, Graduate School of the
Chinese Academy of Sciences, Shanghai 200032, China;
3 Shanghai
Center for Bioinformation Technology, Shanghai 200235, China
Received: November
29, 2006������
Accepted: February
27, 2007
This study was
supported by the grants from the National Natural Science Foundation of China (30470029,
30500107), Key Program of Basic Research of Shanghai (No. 06PJ14072), and the
Major State Basic Research Development Program of China (2004CB720103,
2006CB910705, and 2007CB707803)
# These authors
contributed equally to this work
*Corresponding
authors:
Yixue LI: Tel,
86-21-64363311; Fax, 86-21-64838882; E-mail, [email protected]
Weihong JIANG: Tel,
86-21-54924172; Fax, 86-21-54924015; E-mail, [email protected]
Abstract������� Species of the genus Streptomyces are
major bacteria responsible for producing most natural antibiotics. Streptomyces
coelicolor A3(2) and Streptomyces avermitilis were sequenced in
2002 and 2003, respectively. Two-component signal transduction systems (TCSs),
consisting of a histidine sensor kinase (SK) and a cognate response regulator
(RR), form the most common mechanism of transmembrane signal transduction in
prokaryotes. TCSs in S. coelicolor A3(2) have been analyzed in detail.
Here, we identify and classify the SK and RR of S. avermitilis and
compare the TCSs with those of S. coelicolor A3(2) by computational
approaches. Phylogenetic analysis of the cognate SK-RR pairs of the two species
indicated that the cognate SK-RR pairs fall into four classes according to the
distribution of their orthologs in other organisms. In addition to the cognate
SK-RR pairs, some potential partners of non-cognate SK-RR were found, including
those of unpaired SK and orphan RR and the cross-talk between different
components in either strain. Our study provides new clues for further
exploration of the molecular mechanism for regulation of industrially important
streptomycetes.
Key words������� Streptomyces; two-component system; cross-talk;
phylogenetic analysis
Two-component signal transduction systems (TCSs), consisting of a histidine (His) sensor kinase (SK) and a cognate response regulator (RR), serve as a basic stimulus-response coupling mechanism to allow organisms to sense and respond to changes in many different environmental conditions in prokaryotes [1]. They are widespread not only in almost all prokaryotes and many archaea, but also in some eukaryotes, such as fungi and plants, in which they play an important role in light and hormone signaling.
Most of the SKs are membrane-associated His kinases. Extracellular stimuli are sensed by the periplasmic domain of the SK, and serve to modulate the activities of the SK. The SK catalyzes ATP-dependent autophosphorylation of a specific His residue located in its dimerization domain. The phosphoryl group subsequently transfers from the phosphohistidine of the SK to a specific aspartate (Asp) residue within the conserved regulatory domain of the RR. Phosphorylation of the regulatory domain activates a downstream output domain that elicits the specific cellular response [2].
Streptomyces is a genus of Gram-positive bacteria. Unlike normal bacteria, streptomycetes have a complex development life cycle such as mycelial growth and spore formation. To adapt the particularly complex and variable environment, streptomycetes possess a broad range of metabolic processes and biotransformations [3,4]. The most interesting property of streptomycetes is their ability to produce most natural antibiotics used in human and veterinary medicine and agriculture. Therefore it is quite essential to understand the biological process of remarkable morphological differentiation and antibiotic production in streptomycetes [5]. Over 20 different pleiotropic genes influence antibiotic production in Streptomyces coelicolor A3(2), three of which are TCSs, indicating an important role for protein phosphorylation and phosphory�lation cascades in the regulation of antibiotic production [6].
Two complete genomic sequences of the genus Streptomyces are now available. S. coelicolor A3(2), the best-known representative of streptomycetes, was sequenced in 2002 and has an 8.7 Mbp linear chromosome containing 7825 protein-encoding sequences [3,4]. The second streptomycete genome, S. avermitilis, was published in 2003 [4]. The linear chromosome of this genome is just over 9 Mbp, which is larger than that of S. coelicolor A3(2) but contains� fewer open reading frames (7574 instead of 7825). Comparative analysis of the two streptomycete genomes revealed that there is a common highly conserved 6.5 Mbp region with respect to gene order and content [4]. SKs and RRs of S. coelicolor A3(2) have been analyzed in detail [7]. However, to date, a detailed analysis of TCSs of S. avermitilis has not been reported.
In this study, we attempt to conduct a comparative analysis that will constitute a basis for further exploration of the signal transduction systems of streptomycetes.
Materials and Methods
Identification and
classification of SKs and RRs of S. avermitilis
The genome sequences of S. avermitilis (http://avermitilis.ls.kitasato-u.ac.jp/) were searched against the Interpro database [8] using InterproScan on a local workstation. The SKs were identified by visual inspection of all the proteins that contain the ATPase domain [7,10]. According to the alignments of the 16 amino acids around the conserved histidine for each of the five groups of SKs in Bacillus subtilis [11], we made five hidden Markov models (HMMs) using the hmmbuild program of the HMMER package Version 2.3.2 [12] (http://hmmer.janelia.org/). The identified SKs were assigned to five groups by searching the five HMMs using the hmmpfam program of the HMMER suite (E<10-5). The transmembrane (TM) domains of each SK were acquired using TopPred II (http://bioweb.pasteur.fr/seqanal/interfaces/toppred.html) [13]. The two best matches for each SK were assumed to be the true TM domains, and all the residues between the end of the first and the start of the second TM domain were taken as the sensor domain. Similar to SKs, RRs were identified by checking all the proteins that contain the CheY-like domain. Alignments of the C-terminal output domains of S. avermitilis RRs with those of Escherichia coli RRs revealed the five different groups of RRs [11]. All the functional domains of SKs and RRs were obtained from the InterproScan results.
Comparison of TCSs in S.
coelicolor A3(2) and S. avermitilis
All the orthologs of the TCSs were retrived through KEGG API (http://www.genome.jp/kegg/soap/;
SOAP interface to KEGG) from KEGG SSDB using best-best relations
(Smith-Waterman score>100). The orthologs of SKs in the two streptomycete
genomes with less than 50% amino acid sequence identity in their sensor domains
were ignored.
Sequence alignment and
phylogenetic analysis
In addition to trans-phosphorylation between cognate SK-RR pairs, cross-talk in trans-phosphorylation between non-adjacent SK-RR pairs was also reported [14]. To explore those potential cross-talks in S. coelicolor A3(2) and S. avermitilis, we inferred functional coupling between SKs and RRs based on conservation of SK-RR pairs between genomes, which is employed as a classical computational method to predicting functionally coupled genes [15]. If orthologs of non-cognate SK and RR in S. coelicolor A3(2) and S. avermitilis are adjacent in some other organisms, we assumed that the potential cross-talk in signal transduction might take place between them or they are a potential pair (Fig. 1). We took those non-cognate SK and RR pairs with more than 10 support ortholog pairs in other organisms as the potential pairs or the potential cross-talks of TCSs in either streptomycete genomes. The multiple alignments were obtained by aligning with the program CLUSTAL W [16] and phylogenetic trees were constructed using PHYLIP based on the neighbor-joining algorithm [17] and the bootstrap value was 1000.
Results
Identification of TCSs of S.
avermitilis and comparison with S. coelicolor A3(2)
Sixty-seven SKs were obtained after the visual inspection of S. avermitilis proteins that contain both the ATPase domain and the conserved His domain. Similarly, 68 RRs were found by checking the CheY-like domain and the RR output domain. Of these 67 SKs, 53 are adjacent to RRs (one pair is assumed), and 14 are unpaired. Besides the 53 paired RRs, there are 15 orphan RRs as well (Table 1).
The sensor domains of SKs vary greatly to sense diverse environment stimuli, and the ATP domains, conserved in all SKs, do little to differentiate them . The region around the conserved His, the site of autophosphorylation, proved to be more informative for classification of SKs. Alignment with the region around the His that becomes phosphorylated revealed that the SKs of S. avermitilis fell into the five main groups (I, II, IIIa, IIIb and IV), as defined in B. subtilis [11]. Similar to S. coelicolor A3(2), most of the SKs of the S. avermitilis fell into group II and IIIa (32 and 30, respectively), with only one in group I, one in group IIIb and five in group IV. The RRs were classified to NarL, OmpR, DBD, LytTR and wHtH groups by the relatedness of their output domains (Table 1). The largest number of RRs fell into the NarL group, which are all contain the bacterial regulatory protein LuxR domain (InterPro IPR000792). It was found that the second largest number of RRs containing the C-terminal transcriptional regulatory domain (Trans_reg_C; IPR001867) was related to the OmpR group. LuxR domain and the C-terminal transcriptional regulatory domain are subfamilies of winged helix repressor DNA-binding domain (IPR011991), which contains the winged helix-turn-helix DNA-binding motif. It is notable that SAV6219 contains the ANTAR domain (IPR005561), which is an RNA-binding domain found in bacterial transcription antitermination regulatory proteins [18]. This domain has been detected in various RRs of TCSs and also some one-component sensory regulators from a variety of bacteria. Most activated RRs interact with DNA to activate or repress the transcription of a series of genes, however, ANTAR-containing RR might interact with RNA to resist the termination of special gene transcription.
It is interesting that, as in B. subtilis and S. coelicolor A3(2), all of the SKs in group II are linked with RRs classified as NarL family, whereas all of the group IIIa SKs are paired with RRs belonging to the OmpR family without exception. Two SKs of group IV are paired with RRs containing a typical winged helix-turn-helix domain, while� the SKs of group I and IIIb are unpaired. The conclusion for the conserved relationship is that the catalytic domain of the SKs and both domains of the RRs might have evolved as a unit from a common ancestor [11]. Consistent with this conclusion, the gene orders of SK and RR in the transcription units are preserved within different classes (Table 1).
Most SKs contain a variety of extracellular, intracellular and/or transmembrane functional domains to respond to specific environmental stimuli. As a result, there is little sequence similarity in the N-terminal domains of the SKs. The membrane topology predictions of the SKs indicate that over half of the SKs have three or more TM domains. Three soluble cytoplasmic proteins were predicted for those SKs that contain no TM domain. They might be activated by another SK in the transduction pathway, as for E. coli CheA [19]. The region between the end of the first and the start of the second TM domain was taken as the sensor domain (Table 1). The size of the sensor domains ranges from 3 to 275 amino acids. Eight SKs containing sensor domains of less than 20 amino acids were predicted to belong to a new subfamily of SKs, which is almost entirely buried in the cytoplasmic membrane and frequently linked to ABC transporters. SKs of this new subfamily were speculated to sense changes in membrane structure or topology [20].
PAS and GAF domains, as cytosolic sensing modules, have been found in a large number of SKs. Three SKs that contain PAS domain (IPR000014) were found in the S. avermitilis genome. PAS domain proteins function to detect some signals, such as oxygen, redox potential and light, by binding flavins, haems, chromophores or some other cofactors [21]. The GAF domain (IPR003018) is found in six SKs of S. avermitilis. GAF domains appear to act as binding sites for small ligands that induce the autophosphorylation of the SK and subsequent signal transduction to activate specific gene transcription [22]. One SK (SAV6889) was detected that contains the nitrate and nitrite-sensing� domain (IPR010910), which responds to changes in nitrate and nitrite concentrations.
Phylogenetic analysis of the
TCSs in S. avermitilis and S. coelicolor A3(2) genomes
All the potential pairs of the TCSs in two streptomycete genomes
were analyzed and the numbers of all supported pairs of each potential pair
were counted. The organism distribution of each cognate SK-RR pair and the
support pairs consisting of their orthologs showed that the paired TCSs fell
into four classes: present in most bacteria; present in most actinobacteria;
specific to streptomycetes; and specific to either of the two streptomycete
strains.
KdpD-KdpE of S. coelicolor A3(2), have orthologs in S. avermitilis
and 45 other bacteria. KdpD and KdpE, which regulate the expression of the high
affinity K+ transport system most notably under K+ limiting conditions [23], have been extensively studied in E.
coli and appear to be ubiquitous in most bacteria. Independent phylogenetic
analyses were carried out using amino acid sequences of the regions assigned
for classification of KdpD, KdpE and their orthologs (Fig. 2). The
similarity of the two phylogenetic trees could imply the inherent functional
connection between the catalytic domain of the SKs and both the regulatory and
output domains of the RRs.
In the S. coelicolor A3(2) genome, 12 pairs of TCSs are present in most actinobacteria, and 42 TCS pairs are specific to streptomycetes. CutRS, which might play roles in the production of actinorhodin [24], and ChiRS, which is related to chitinase production [25], might be specific to streptomycete strains. The TCS pairs specific in streptomycetes exceed more than those present in most bacteria or in the actinobacteria group, suggesting that this genus might be well equipped to adapt to a wide range of environmental stimuli and stresses, and to regulate complex multicellular development, a broad range of metabolic processes and biotransformation.
Phylogenetic analysis using all orthologs of available organisms
revealed that 26 pairs of TCSs are specific to S. coelicolor A3(2). As
mentioned above, the S. coelicolor A3(2) genome contains
31 paired TCSs that are specific compared to the S. avermitilis genome. Of
these paired TCSs, five have orthologs in some other organisms but not in S.
avermitilis. While in the 17 paired TCSs specific to S. avermitilis,
only one pair has orthologs in other bacteria.
Discussion
The basic TCS elements can be combined to produce a His-Asp-His-Asp phosphorelay. Central to this phos�phorelay pathway is a hybrid-type SK that contains both an SK core and an RR receiver domain in a single protein [26]. Two hybrid-type SKs and their orthologs (SAV1085 with SCO7327, SAV2512 with SCO5748) were identified in S. avermitilis and S. coelicolor A3(2). The third hybrid-type SK in S. avermitilis, SAV 5564, has no ortholog in S. coelicolor A3(2). The complexity of phosphorelay systems permits the integration of multiple check points and regulatory steps into the pathway [27].
In addition to the cognate TCS pairs we have discussed in detail, some potential pairs that are not adjacent in the genome were found by the ortholog analysis. The SK SAV1990 and RR SAV1988 are not back-to-back in the S. avermitilis genome. However, our analysis suggested that they are cognate, as their orthologs (SCO6253 and SCO6254) are adjacent and they accord with the rule that all SKs falling into group II are linked with an RR classified to the NarL family (Table 1). Orthologs of S. avermitilis TCSs, SAV7118 (SK) and SAV7115 (RR) are adjacent in 47 organisms (Table 2), and orthologs of SAV3017 (SK) and SAV6219 (RR) are adjacent in 10 organisms. These four TCSs had been supposed to be unpaired SKs or orphan RRs in S. avermitilis because they did not have an adjacent pair in the genome, but the cases in other organisms indicated that they might be two paired TCSs.
Microarray analysis for the TCS mutants of E. coli has represented that TCSs functionally interact with each other, at least for certain combinations, to expand the signal transduction network so as to allow some genes to respond to a wide range of environmental stimuli [28]. Trans-phosphorylation in vitro was detected between non-cognate SK-RR pairs in E. coli at a rate of approximately 3% [14], raising the possibility that the cross-talk in signal transduction takes place between non-cognate SKs and RRs. In this study, some non-cognate SK-RR pairs were found in both the S. avermitilis and S. coelicolor A3(2) genomes that have cognate ortholog SK-RR pairs in other organisms (Table 2), suggesting that cross-talk might take place between them. TCSs, such as SCO7534, SCO3741, SCO1136 and SCO1801 in S. coelicolor A3(2), and SAV2430, SAV2971, SAV4416 and SAV4047 in S. avermitilis might be inclined to take part in the cross-talk. The streptomycete strain-specific TCSs have insufficient support pairs, so that the cross-talk referencing to these TCSs might be omitted. While our results accord with the corollary that if cross-talk between SK-RR pairs is of regulatory significance, it is likely to occur only within a group [11].
The identification and classification of the TCSs in the S. avermitilis genome provide the foundation to understand the signal transduction system of the strain. Approximately 80% of the commercially available antibiotics are produced by the streptomycetes, therefore comparison analysis of the TCSs of the two streptomycete strains can improve our understanding of the two-component systems of this organism and also provide insight into the molecular mechanisms of regulation in industrially important strains of streptomycetes.
Acknowledgements
We are grateful to Drs. Govind CHANDRA and Matthew I. HUTCHINGS from the John Innes Centre (Nowich, UK), for their useful discussion and valuable advice.
References
1�� Stock AM, Robinson VL, Goudreau PN.
Two-component signal transduction. Annu Rev Biochem 2000, 69: 183-215
2�� West AH, Stock AM. Histidine kinases and
response regulator proteins in two-component signaling systems. Trends Biochem
Sci 2001, 26: 369-376
3�� Bentley SD, Chater KF, Cerdeno-Tarraga AM,
Challis GL, Thomson NR, James KD, Harris DE et al. Complete genome sequence
of the model actinomycete Streptomyces coelicolor A3(2). Nature 2002,
417: 141-147
4�� Ikeda H, Ishikawa J, Hanamoto A, Shinose M,
Kikuchi H, Shiba T, Sakaki Y et al. Complete genome sequence and
comparative analysis of the industrial microorganism Streptomyces
avermitilis. Nat Biotechnol 2003, 21: 526-531
5�� Chater KF. Genetics of differentiation in
Streptomyces. Annu Rev Microbiol 1993, 47: 685-713
6�� Kieser T, Bibb MJ, Buttner MJ, Chater KF,
Hopwood DA. Practical Streptomyces Genetics. 2nd edn. Norwich: The John Innes
Foundation 2000
7�� Hutchings MI, Hoskisson PA, Chandra G,
Buttner MJ. Sensing and responding to diverse extracellular signals? Analysis
of the sensor kinases and response regulators of Streptomyces coelicolor
A3(2). Microbiology 2004, 150: 2795-2806
8�� Mulder NJ, Apweiler R, Attwood TK, Bairoch A,
Bateman A, Binns D, Bradley P et al. InterPro, progress and status in
2005. Nucleic Acids Res 2005, 33: D201-D205
9�� Zdobnov EM, Apweiler R. InterProScan���an
integration platform for the signature-recognition methods in InterPro.
Bioinformatics 2001, 17: 847-848
10� Murzin AG, Brenner SE, Hubbard T, Chothia C.
SCOP: A structural classification of proteins database for the investigation of
sequences and structures. J Mol Biol 1995, 247: 536-540
11� Fabret C, Feher VA, Hoch JA. Two-component
signal transduction in Bacillus subtilis: How one organism sees its
world. J Bacteriol 1999, 181: 1975-1983
12� Eddy SR. Profile hidden Markov models.
Bioinformatics 1998, 14: 755-763
13� Claros MG, Vonheijne G. Toppred II: An
improved software for membrane-protein structure predictions. Comput Appl
Biosci 1994, 10: 685-686
14� Yamamoto K, Hirao K, Oshima T, Aiba H, Utsumi
R, Ishihama A. Functional characterization in vitro of all two-component
signal transduction systems from Escherichia coli. J Biol Chem 2005,
280: 1448�1456
15� Overbeek R, Fonstein M, D�Souza M, Pusch GD,
Maltsev N. The use of gene clusters to infer functional coupling. Proc Natl
Acad Sci USA 1999, 96: 2896-2901
16� Chenna R, Sugawara H, Koike T, Lopez R, Gibson
TJ, Higgins DG, Thompson JD. Multiple sequence alignment with the Clustal
series of programs. Nucleic Acids Res 2003, 31: 3497-3500
17� Felsenstein J. PHYLIP: Phylogeny inference
package (Version 3.2). Cladistics 1989, 5: 164-166
18� Shu CYJ, Zhulin IB. ANTAR: An RNA-binding
domain in transcription antitermination regulatory proteins. Trends Biochem Sci
2002, 27: 3-5
19� Levit M, Liu Y, Surette M, Stock J. Active
site interference and asymmetric activation in the chemotaxis protein histidine
kinase CheA. J Biol Chem 1996, 271: 32057-32063
20� Mascher T, Margulis NG, Wang T, Ye RW, Helmann
JD. Cell wall stress responses in Bacillus subtilis: The regulatory
network of the bacitracin stimulon. Mol Microbiol 2003, 50: 1591-1604
21� Taylor BL, Zhulin IB. PAS domains: Internal
sensors of oxygen, redox potential, and light. Microbiol Mol Biol Rev 1999, 63:
479-506
22� Aravind L, Ponting CP. The GAF domain: An
evolutionary link between diverse phototransducing proteins. Trends Biochem Sci
1997, 22: 458-459
23� Wood JM. Osmosensing by bacteria: Signals and
membrane-based sensors. Microbiol Mol Biol Rev 1999, 63: 230-262
24� Chang HM, Chen MY, Shieh YT, Bibb MJ, Chen CW.
The cutRS signal transduction system of Streptomyces lividans represses
the biosynthesis of the polyketide antibiotic actinorhodin. Mol Microbiol 1996,
21: 1075-1085
25� Tsujibo H, Hatano N, Okamoto T, Endo H,
Miyamoto K, Inamori Y. Synthesis of chitinase in Streptomyces
thermoviolaceus is regulated by a two-component sensor-regulator system.
FEMS Microbiol Lett 1999, 181: 83-90
26� Zhang WW, Shi L. Distribution and evolution of
multiple-step phosphorelay in prokaryotes: Lateral domain recruitment involved
in the formation of hybrid-type histidine kinases. Microbiology 2005, 151: 2159-2173
27� Robinson VL, Buckler DR, Stock AM. A tale of
two components: A novel kinase and a regulatory switch. Nat Struct Biol 2000,
7: 626-633
28� Oshima T, Aiba H, Masuda Y, Kanaya S, Sugiura
M, Wanner BL, Mori H et al. Transcriptome analysis of all two-component
regulatory system mutants of Escherichia coli K-12. Mol Microbiol 2002,
46: 281-291