TY - JOUR AU1 - Worobey, Michael AU2 - Holmes, Edward C. AB - Abstract Analysis of 33 GB virus C/hepatitis G virus (GBV-C/HGV) full or nearly full genome sequences revealed several putative inter- and intrasubtype recombinants. The breakpoints of the recombinant regions were mapped using a maximum-likelihood method, and the statistical significance for each region was tested using Monte Carlo simulation. The results were highly significant and provided evidence for the existence of complex mosaic genomes showing as many as nine recombination events, with breakpoints in the 5′ UTR and in all of the coding regions except the short NS4b gene. Recombination was confirmed by separate phylogenetic analysis of the various recombinant regions and by Sawyer's runs test. Taken together, these findings demonstrate for the first time that recombination is common in natural populations of GBV-C and that it takes place both within and between subtypes. The wide-ranging implications of such nonclonal history for reconstructing the spread and timescale of GBV-C evolution are discussed. Introduction Recently, a new virus belonging to the family Flaviviridae was independently isolated by two different groups and designated GB virus C (GBV-C) (Simons et al. 1995 ) or hepatitis G virus (HGV) (Linnen et al. 1996 ). Despite its original standing as a putative agent of hepatitis, GBV-C is unique among Flaviviridae infecting human beings (others include hepatitis C virus, dengue virus, and West Nile virus) in that it does not appear to be associated with disease (Fan et al. 1999 ). Infection is especially widespread in populations at risk for blood-borne transmission (20% in patients who have received multiple transfusions, 30% in intravenous drug users) but is common even in healthy populations worldwide (1%–2% in blood donors in the United States and Japan) (Naito, Win, and Abe 1999 ). Phylogenetic analyses based mostly on 5′ UTR sequences, although more recent analyses have used full-genome sequences of isolates from around the world, have identified four genetic subtypes of GBV-C (type 1–4) with highly regional distributions (Muerhoff et al. 1997 ; Saito et al. 1999 ; Naito, Win, and Abe 1999 ; Naito, Hayashi, and Abe 2000 ). Type 1 is common in Africa, types 3 and 4 are common in Asia, and type 2 is common in Europe and the United States. Moreover, a virus closely related to GBV-C circulates in chimpanzees (Adams et al. 1998 ), while more distant homologs are present in New World monkeys (Bukh and Apgar 1997 ). This phylogeographic pattern has led to suggestions that GB viruses have cospeciated with their primate hosts (Charrel, De Micco, and de Lamballerie 1999 ) and has been interpreted by some as evidence for an African origin of GBV-C that predates human migration out of that continent around 100,000 years ago (e.g., Tanaka et al. 1998 ; Suzuki et al. 1999 ). Under this hypothesis, current phylogenetic relationships among GBV-C isolates may reflect ancient human population movements. In addition to providing a potentially important viral marker for human population history and a model system for the study of the evolution of nonpathogenic infectious agents, GBV-C is puzzling in a variety of ways. For instance, the apparently long timescale of GBV-C evolution in humans seems difficult to reconcile with the high evolutionary rates expected of RNA viruses, which would predict only residual sequence similarities after 100,000 years of evolution (Simmonds and Smith [1999] , but see Nakao et al. [1997] and Suzuki et al. [1999] for dramatically different substitution rate estimates for GBV-C inferred from the same data). An apparent conflict between high rates of nucleotide substitution and clear sequence similarities between viruses which may have cospeciated with their hosts has also been raised with respect to the primate lentiviruses (Sharp et al. 2000). Furthermore, phylogenetic groupings among GBV-C isolates are often inconsistent across different regions of the genome, unlike related viruses such as hepatitis C virus (HCV) (Muerhoff et al. 1997 ; Smith et al. 2000 ). This has led to extensive efforts to find small regions of the GBV-C genome that will faithfully replicate the phylogenetic trees produced using longer sequences and hence will aid typing schemes (e.g., Muerhoff et al. 1997 ; Smith et al. 2000 ). One fundamental evolutionary process that may have been overlooked in previous studies of GBV-C, and which could play a key role in determining its distinctive patterns of genetic diversity, is recombination. Several observations, including that (1) GBV-C infection is common, (2) infection is often persistent, and (3) homologous recombination is known to be a property of Flaviviridae viruses (Holmes, Worobey, and Rambaut 1999 ; Worobey, Rambaut, and Holmes 1999 ), led us to wonder what role, if any, recombination plays in the evolution of GBV-C. Although many RNA viruses are assumed to be clonal, their population structure could, in fact, range from truly clonal to effectively panmictic, depending on the extent of recombination (Worobey and Holmes 1999 ). Establishing where a given virus falls along this continuum is not a trivial task, but it is important if we are to trust the results of phylogenetic analyses. If recombination has been sufficiently common to make a phylogenetic tree an inappropriate representation of evolutionary history, then inferences based on it must be treated with caution. In this study, we collected published gene sequences of full and nearly full genomes of GBV-C and employed methods for detecting putative recombinants, for locating possible breakpoints, and for evaluating these results for statistical significance to test the hypothesis that GBV-C represents a clonal group of viruses. Our results indicate that GBV-C is not clonal. Materials and Methods The positive-sense RNA genome of GBV-C, about 9,400 nt in length, encodes a single polyprotein of approximately 2,900 amino acids comprising eight genes, in the order E1, E2, NS2, NS3, NS4a, NS4b, NS5a, and NS5b (Erker et al. 1996 ). We collected 33 published full or nearly full genome sequences of GBV-C from GenBank with accession numbers and sequence names as follows: U36380, GBC-WAF; AB013500, GA128-GHA; AB003291, CG12LC-JAP; AF031827, T55875-USA; AB003289, CGO1BD-JAP; D87255, Iw-JAP; D90600, T110-JAP; U63715, EA-EAF; AF104403, AUQ-FRA; U45966, R10291-USA; U44402, PNF2161-USA; AF121950, Iowan-USA; AB013501, BL230-BOL; D87263, GSI93-JAP; AB008342, IM71-JAP; AB008335, K3732-JAP; AB003288, BG1HC-JAP; AB003290, CG07BD-JAP; AB003293, G13BD-JAP; D90601, T230-JAP; D87710, K1741-JAP; AF006500, GD-CHI; U94695, CN-CHI; U75356, C964-CHI; D87711, K1789-JAP; D87713, K2141-JAP; D87712-JAP, K1916-JAP; D87709, K1737-JAP; D87708, K606-JAP; D87715, K1775-JAP; AB003292, GO5BD-JAP; AB021287, MY14-MYA; AB018667, VT48-VIE. (The last three letters in each name indicate geographic origin of the isolate: WAF = West Africa; GHA = Ghana; JAP = Japan; USA = United States; EAF = East Africa; FRA = France; BOL = Bolivia; CHI = China; MYA = Myanmar; and VIE = Vietnam.) The sequences were aligned manually, and the alignment may be obtained from the authors on request. Initially, we used exploratory tree analysis to look for evidence of conflicting phylogenetic signal within particular sequences (see Gao et al. [1998] for a similar approach). First, we reconstructed neighbor-joining (NJ) trees on 600-nt windows of the original alignment shifted in 300-nt increments, as well as for the full-length data set, with distances estimated using the HKY85 (Hasegawa, Kishino, and Yano 1985 ) model of DNA substitution. Methods correcting for site-specific rate heterogeneity were not employed at this point, since they would have greatly increased the time but not the quality of this analysis, which was intended only to identify sequences that dramatically changed phylogenetic position over the length of their genomes. Such possible mosaic strains, which clustered with distinct groups in different regions of the full alignment, were then analyzed using diversity plots which show the similarity of a particular sequence across its whole length compared with different putative “parents”—the related sequences it most closely resembles in certain regions (e.g., Gao et al. 1998 ; Worobey, Rambaut, and Holmes 1999 ). The putative recombinants recovered using these methods were then subjected to maximum-likelihood (ML) breakpoint estimation using the program LARD (Holmes, Worobey, and Rambaut 1999 ; http://evolve.zoo.ox.ac.uk). LARD finds optimal recombination breakpoints, given a sequence alignment of a putative recombinant and its two parents, by breaking the alignment into two parts and reconstructing a separate ML tree for each. For every possible partition of the alignment, the process is repeated, and the ML scores are combined to produce a “recombination model” score for that set of breakpoints. If recombination has occurred among the sequences, the highest score is expected when the alignment is broken at the actual recombination breakpoint, since the two trees reconstructed in this case best reflect the true phylogenetic history of the separate recombinant regions. Estimating the breakpoints in this manner leads to a natural test of their statistical significance: the recombination model likelihood score can be compared with the likelihood score for the unbroken alignment (i.e., the “no-recombination model”) by a likelihood ratio test and a Monte Carlo approach using sequences simulated without recombination but with the observed levels of site-specific rate heterogeneity (see Holmes, Worobey, and Rambaut 1999 ; Worobey, Rambaut, and Holmes 1999 ). Because more than two putative parents were inferred in some cases, and this method becomes computationally intensive with longer sequences, the alignment was broken into nine overlapping fragments during LARD analysis. Except at the 5′ and 3′ ends, these fragments were 1,200 nt in length and overlapped neighboring fragments by 200 nt at each end. For each region analyzed, 200 simulated sequence alignments (used to generate null distributions for significance testing) were produced using ML parameter values estimated from the observed data (under the HKY85 model of nucleotide substitution, with transition : transversion ratios and shape parameters of the gamma distribution of among-sites rate heterogeneity estimated during tree reconstruction). All trees were produced using PAUP*, version 4 (Swofford 1998 ). The conflicting phylogenetic signal contained within the genomes of putative recombinants (i.e., those cases that showed a much greater improvement than expected by chance even when site-specific rate heterogeneity was accounted for) was then confirmed by bootstrap phylogenetic trees (10,000 replicates using NJ trees with distances estimated using the HKY85 model of DNA substitution). Finally, to complement our phylogenetic approach to analyzing recombination, we applied Sawyer's (1989) runs test. This method, implemented in the package GENECONV (S. Sawyer, Department of Mathematics, Washington University, St. Louis, Mo.) searches for unusually long fragments within an alignment over which a pair of sequences are identical or nearly identical, then assesses the significance of the hypothesis that the similar fragments arose by recombination by using randomly permuted data sets derived from the real alignment. For each fragment, the test calculates two P values: “pairwise” permutation P values represent the proportion of the permuted data having a score for a certain sequence pair greater than or equal to the real score for that pair; “global” permutation P values reflect the proportion of permuted alignments for which some fragment for some pair of sequences has a higher score than a particular fragment. We chose this test because it seemed better suited than the phylogenetic method described above to detecting large amounts of recombination among relatively similar viruses, such as those within a GBV-C subtype. In all cases, the default parameters were used with the following exceptions: indels were skipped, mismatches were tolerated (Gscale = 1), and the thresholds used for global and pairwise P values were set at 0.05 and 0.01, respectively. Results Exploratory phylogenetic tree and diversity analyses revealed several possible recombinant sequences among the 33 in the alignment, including both putative intersubtype and putative intrasubtype recombinants. Possible intrasubtype recombinants were particularly numerous, with nearly every type 3 sequence, for instance, shifting topological position across its genome in a manner suggestive of recombination (data not shown). The tree for the full-length alignment, however, exhibited high bootstrap support for each of the four subtypes and placed the putative intersubtype recombinants decisively within individual subtypes (fig. 1 ). Subsequent analysis of the putative intersubtype sequences using LARD highlighted three, MY14-MYA, BL230-BOL, and C964-CHI, to be complex recombinants, each exhibiting more than one apparent recombination event (table 1 ). Each recombinant region was highly significant, having an observed likelihood ratio higher than any of the 200 likelihood ratios from the Monte Carlo simulated alignments for each region (P < 0.005). Both BL230-BOL and C964-CHI contained sequence from more than two sources, indicating multiple recombination events. The Chinese sequence C964-CHI was particularly impressive, showing no less than nine apparent recombination events and genetic material from at least four different sources covering three different subtypes. Although the impact of recombination on GBV-C remains an important unresolved question (Smith et al. 2000), others have speculated that two of these sequences (C964-CHI and BL230-BOL) might be recombinant based on phylogenetic inconsistencies between their 5′ and 3′ regions (An et al. 1997 ; Konomi et al. 1999 ). A comparison of the mosaic sequences with a map of the GBV-C genome (fig. 2 ) revealed recombination breakpoints within the 5′ UTR, E1, E2, NS2, NS3, NS4a, NS5a, and NS5b regions. Phylogenetic trees with bootstrap support constructed from the various (concatenated) recombinant regions listed in table 1 clearly demonstrated that each of these sequences was a mosaic containing distinct regions with different phylogenetic histories. MY14-MYA (fig. 3a ), a type 4 sequence over most of its genome, also contained regions of type 2 sequence. The 5′ and 3′ ends of BL230-BOL (fig. 3b ) belonged to type 3, while the type 2 region in between also showed evidence of intrasubtype recombination. Trees for C964-CHI (fig. 3c ) revealed regions of particularly high similarity to the type 2 isolate R10291-USA and the type 1 isolate GBC-WAF, again strongly supporting the conclusion that it was an intersubtype recombinant. For two of the intersubtype recombinants, BL230-BOL and MY14-MYA, we were able to obtain information on the strategy of genome sequencing—specifically, the size and location of the individual fragments used to generate the full genome sequences (Kaneko et al. 1998 ; Konomi et al. 1999 ; Naito, Hayashi, and Abe 2000 ). The lack of correspondence between the recombination breakpoints and the fragment boundaries argued strongly against the possibility that these mosaic genomes were the product of sequencing errors involving contaminated or multiply infected samples. Furthermore, the remarkable mosaic genome of C964-CHI shown in figure 2c, with four different sources of sequence and at least 17 distinct regions, was best explained by a history of recombination and not extensive laboratory error. Finally, each intersubtype recombinant was sequenced in a different laboratory and on a different continent than at least one of its parents. This observation, together with the lack of correspondence between recombination breakpoints and sequencing breakpoints, makes it very unlikely that these apparent recombinants represent laboratory contamination. In addition to the intrasubtype recombination evident in BL230-BOL, two type 3 intrasubtype recombinants, K1916-JAP and K1789-JAP, were also characterized by the above methods (fig. 2d and e ), with P < 0.005 in both cases and results confirmed with bootstrap phylogenetic trees (data not shown). However, because these sequences were almost identical in their recombinant regions to otherwise divergent viruses, they were simply the most obvious of many apparent intrasubtype recombinants. The results of the runs test on the full alignment of 33 sequences provided strong support for recombination in GBV-C: 407 fragments had pairwise P values <0.01, and 78 fragments had global P values <0.05. In the absence of recombination, we would expect no global P values <0.05 and very few pairwise P values <0.01 (randomized data yielded only zero global and five pairwise significant fragments). Significantly, the runs test results corresponded closely to those of the phylogenetic approach, in many cases identifying virtually identical sequence fragments as highly significant even though the two methods weighted mismatches slightly differently. Table 2 lists all the fragments with global P values <0.005 for the analysis of all 33 GBV-C sequences. As expected, the runs test also identified many putative intrasubtype recombination events, especially in the well-represented type 3 group. The 17 type 3 sequences analyzed on their own produced 23 global significant and 98 pairwise significant scores (zero global and four pairwise significant scores for randomized data), while the 10 type 2 sequences produced 5 global and 17 pairwise significant scores (zero global and zero pairwise significant scores for randomized data). Moreover, 14 of the 17 type 3 sequences had fragments with pairwise scores higher than any of the 10,000 permuted data sets, and the remaining three had pairwise P values <0.005, suggesting that most, or perhaps all, of them have been affected by recombination. Discussion Our phylogenetic study produced strong evidence for recombination in GBV-C. Figure 3 shows that the methods we used to identify putative recombinants and putative breakpoints identified long, contiguous regions within individual genomes that undoubtedly belonged to different subtypes. MY14-MYA, for instance, is unambiguously a type 4 sequence in some regions and a type 2 sequence in others. This effect was just as clear when only synonymous third (and presumably neutral) sites were examined (data not shown). While some types of conflicting phylogenetic signal can be produced by processes other than recombination, there is no other known way in which such clearly partitioned conflicting mosaic patterns can arise. Taken together, these results demonstrate for the first time that GBV-C is a recombinogenic virus. Moreover, they suggest that recombination may play a large role in shaping its genetic diversity. Several lines of evidence support this conclusion. First, the high number of confirmed mosaics in the data set shows that recombination can occur both within and between subtypes. Next, the results of Sawyer's runs test reflect a great deal of possible recombination, especially within subtypes. Finally, the existence of complex recombinants like C964-CHI and BL230-BOL, which evidently contain sequence originating from several sources, strongly implies a series of recombination events that would be unlikely unless this process were rather commonplace. Failure to take recombination into account in GBV-C data sets may have seriously affected previous studies of the evolutionary history of this virus. For example, Simmonds and Smith (1999) found an excess of invariant synonymous sites in GBV-C coding regions compared with the frequency expected by chance, which they estimated through simulations of sequence evolution. They suggested that this difference reflected large constraints imposed by RNA secondary structure on GBV-C genomic change and concluded that such constraints invalidated the use of the molecular clock for this and perhaps other RNA viruses. However, their simulated data sets, while reflecting observed transition : transversion ratios, base frequencies, and relative frequencies of synonymous and nonsynonymous substitutions, did not incorporate recombination or site-specific rate heterogeneity. Since recombination can dramatically inflate the apparent degree of rate heterogeneity, it is not surprising that this is essentially what their analysis of this recombinant virus revealed. It therefore seems premature to conclude that such results indicate important evolutionary constraints imposed by RNA secondary structure, as opposed to the effects of overlooked recombination. The influence of recombination on patterns of genetic diversity also provides a plausible explanation for why phylogenetic trees reconstructed using small segments of the GBV-C genome do not always mirror those produced using longer sequences. This discrepancy stands in marked contrast to what is seen for hepatitis C virus, an apparently clonal relative, where analysis of a variety of subgenomic regions reproduces the phylogenetic relationships of full genomes (Smith et al. 2000). One explanation proposed for this difference is that many GBV-C sites may have become saturated, thereby obscuring evolutionary relationships (Muerhoff et al. 1997 ). However, since sequence divergence in GBV-C is less than that in HCV, it seems likely that recombination has played at least as important a role here in obscuring phylogenetic relationships. Hence, the desirability of finding small segments of the GBV-C genome that faithfully reproduce the potentially deceptive phylogenetic relationships inferred from complete genomes should be reconsidered. Although it remains a matter of future research to precisely establish the ways in which recombination might bias estimates of population divergence times, it is likely that the process seriously affects the reliability of this virus as a marker for human population history. Previous approaches (e.g., Suzuki et al. 1999 ) that implicitly assumed a clonal history for GBV-C may have underestimated its age by ignoring the homogenizing effects of recombination. However, while correctly accounting for recombination may push back the perceived time depth of the GBV-C phylogeny, it is worth noting that its very ancient pedigree is still only an assumption. Despite the regional distributions of GBV-C genotypes, taken by some as evidence of an “out of Africa” diversification, most subtypes are found in diverse locations, and some areas exhibit the “wrong” subtype for their region. For example, a study of GBV-C in Vietnam and Myanmar (Naito, Win, and Abe 1999 ) revealed type 2 to be the most prevalent in these Southeast Asian countries, while the supposedly “Asian” types were less common. Nevertheless, the fact that some identifiable subtypes of GBV-C have apparently arisen and persisted in the face of recombination is illuminating. Even if some of the recognized subtypes had a recombinant origin, and even if the gaps in the GBV-C phylogeny were eventually filled by strains that fell outside these subtypes, there is clearly some geographically associated population structure within worldwide GBV-C. This implies that recombination has not been pervasive enough to obscure all genealogical information within GBV-C. In other words, some subpopulations of the virus, although they may generally recombine freely within themselves, evidently have been isolated enough from one another in the past to accumulate characteristic differences. Whether increased mixing of human populations in the future may lead to sufficient opportunity for recombination to break down the current population structure is an open question. Many phylogeny-based methodologies can fall afoul of unrecognized recombination, and the implications of this study apply generally to viruses and other organisms that are often analyzed as though they were clonal. For example, Sharp et al. (2000) demonstrated how incorporating estimates of site-specific rate heterogeneity can push back the estimated divergence times on the primate lentivirus tree, but they did not investigate the possibility that recombination could also affect these estimates. Likewise, inferences about demographic history drawn from the shapes of phylogenetic trees (e.g., Pybus, Holmes, and Harvey 1999 ) could also be compromised by recombination. Starlike gene genealogies, often interpreted as signatures of epidemic expansion, may actually reflect populations subject to high levels of recombination. Indeed, recombination, and not demographic history, may underlie the remarkably starlike tree of GBV-C (fig. 1 ). Although more work needs to be done investigating such concerns, the possibility that recombination has biased our reconstruction of population history should not be overlooked. Keith A. Crandall, Reviewing Editor 1 Keywords: recombination GB virus C hepatitis G virus phylogeny maximum likelihood mosaic 2 Address for correspondence and reprints: Michael Worobey, Department of Zoology, University of Oxford, South Parks Road, Oxford OX1 3PS, United Kingdom. E-mail: michael.worobey@zoo.ox.ac.uk. Table 1 Breakpoint Analysis Results for the Three Putative Intersubtype Recombinants View Large Table 2 Results of Sawyer's Runs Test (fragments with global P value <0.005) View Large View largeDownload slide Fig. 1.—Neighbor-joining phylogenetic tree with bootstrap percentages for the 33 full or nearly full genomes analyzed in this study. Branch lengths are drawn to scale. The four described subtypes are supported by high bootstrap values. Three putative intersubtype recombinants, MY14-MYA, BL230-BOL, and C964-CHI, are boxed. All fall with high bootstrap scores within one of the four subtypes, even though large regions of their genomes appear to belong to other GBV-C subtypes View largeDownload slide Fig. 1.—Neighbor-joining phylogenetic tree with bootstrap percentages for the 33 full or nearly full genomes analyzed in this study. Branch lengths are drawn to scale. The four described subtypes are supported by high bootstrap values. Three putative intersubtype recombinants, MY14-MYA, BL230-BOL, and C964-CHI, are boxed. All fall with high bootstrap scores within one of the four subtypes, even though large regions of their genomes appear to belong to other GBV-C subtypes View largeDownload slide Fig. 2.—Schematic diagrams of the GBV-C genome, the three intersubtype recombinant genomes, and two intrasubtype recombinant genomes. The polyprotein-coding region (partial core protein) starts from nucleotide 565, and the start of each gene region is marked (numbers are shown below the genome). The recombinant regions for (a) MY14-MYA, (b) BL230-BOL, and (c) C964-CHI correspond with table 1 and reveal the complex mosaic nature of these sequences. Comparison of these estimates with the GBV-C genome map confirms the presence of recombination breakpoints in all parts of the genome except the relatively small NS4b and 3′ UTR regions. Two type 3 intrasubtype recombinants, (d) K1916 and (e) K1789, are also shown View largeDownload slide Fig. 2.—Schematic diagrams of the GBV-C genome, the three intersubtype recombinant genomes, and two intrasubtype recombinant genomes. The polyprotein-coding region (partial core protein) starts from nucleotide 565, and the start of each gene region is marked (numbers are shown below the genome). The recombinant regions for (a) MY14-MYA, (b) BL230-BOL, and (c) C964-CHI correspond with table 1 and reveal the complex mosaic nature of these sequences. Comparison of these estimates with the GBV-C genome map confirms the presence of recombination breakpoints in all parts of the genome except the relatively small NS4b and 3′ UTR regions. Two type 3 intrasubtype recombinants, (d) K1916 and (e) K1789, are also shown View largeDownload slide Fig. 3.—Neighbor-joining phylogenetic trees with bootstrap percentages for the three intersubtype recombinants. The recombinant sequence for each tree is highlighted with a box. Branch lengths are drawn to scale. Regions corresponding to each parent were concatenated and are shown above each tree. a, MY14-MYA jumps from type 4 to type 2, with 100% bootstrap support for the conflicting regions. b, BL230-BOL moves from type 3 to type 2, being alternately most similar to one, then another type 2 sequence. c, C964-CHI groups with four different parents from three different subtypes, with 100% bootstrap support in each case View largeDownload slide Fig. 3.—Neighbor-joining phylogenetic trees with bootstrap percentages for the three intersubtype recombinants. The recombinant sequence for each tree is highlighted with a box. Branch lengths are drawn to scale. Regions corresponding to each parent were concatenated and are shown above each tree. a, MY14-MYA jumps from type 4 to type 2, with 100% bootstrap support for the conflicting regions. b, BL230-BOL moves from type 3 to type 2, being alternately most similar to one, then another type 2 sequence. c, C964-CHI groups with four different parents from three different subtypes, with 100% bootstrap support in each case We thank Dr. Kenji Abe for his helpful correspondence and Dr. David Robertson for comments on the manuscript. This work was supported by grants from the Rhodes Trust, the Natural Sciences and Engineering Research Council of Canada, the Royal Society, and the Wellcome Trust. literature cited Adams, N. J., L. E. Prescott, L. M. Jarvis, J. C. M. Lewis, M. O. McClure, D. B. Smith, and P. Simmonds. 1998. Detection in chimpanzees of a novel flavivirus related to GB virus/C hepatitis G virus. J. Gen. Virol.   79: 1871–1877. Google Scholar An, P., L. Wei, X. Y. Wu, N. Yuhki, S. J. O'Brien, and C. Winkler. 1997. Evolutionary analysis of the 5′-terminal region of hepatitis G virus isolated from different regions in China. J. Gen. Virol.   78: 2477–2482. Google Scholar Bukh, J., and C. L. Apgar. 1997. Five new or recently discovered (GBV-A) virus species are indigenous to New World monkeys and may constitute a separate genus of the Flaviviridae. Virology 229:429–436. Google Scholar Charrel, R. N., P. De Micco, and X. de Lamballerie. 1999. Phylogenetic analysis of GB viruses A and C: evidence for cospeciation between virus isolates and their primate hosts. J. Gen. Virol.   80: 2329–2335. Google Scholar Erker, J. C., J. N. Simons, A. S. Muerhoff, T. P. Leary, M. L. Chalmers, S. M. Desai, and I. K. Mushahwar. 1996. Molecular cloning and characterization of a GB virus C isolate from a patient with non-A-E hepatitis. J. Gen. Virol.   77: 2713–2720. Google Scholar Fan, X. F., Y. J. Xu, H. Solomon, S. Ramrakhiani, B. A. Neuschwander-Tetri, and A. M. Di Bisceglie. 1999. Is hepatitis G/GB virus-C virus hepatotropic? Detection of hepatitis G/GB virus-C viral RNA in liver and serum. J. Med. Virol.   58: 160–164. Google Scholar Gao, F., D. L. Robertson, C. D. Carruthers et al. (13 co-authors). 1998. A comprehensive panel of near-full-length clones and reference sequences for non-subtype B isolates of human immunodeficiency virus type 1. J. Virol.   72: 5680–5698. Google Scholar Hasegawa, M., H. Kishino, and T. Yano. 1985. Dating the human-ape splitting by a molecular clock of mitochondrial DNA. J. Mol. Evol.   22: 160–174. Google Scholar Holmes, E. C., M. Worobey, and A. Rambaut. 1999. Phylogenetic evidence for recombination in dengue virus. Mol. Biol. Evol.   16: 405–409. Google Scholar Kaneko, T., S. Hayashi, Y. Arakawa, and K. Abe. 1998. Molecular cloning of full-length sequence of hepatitis G virus genome isolated from a Japanese patient with liver disease. Hepatol. Res.   12: 207–216. Google Scholar Konomi, N., C. Miyoshi, C. L. Zerain, T. C. Li, Y. Arakawa, and K. Abe. 1999. Epidemiology of hepatitis B, C, E, and G virus infections and molecular analysis of hepatitis G virus isolates in Bolivia. J. Clin. Microbiol.   37: 3291–3295. Google Scholar Linnen, J., J. Wages, Z. Y. Zhang-Keck et al. (30 co-authors). 1996. Molecular cloning and disease association of hepatitis G virus: a transfusion-transmissible agent. Science 271:505–508. Google Scholar Muerhoff, A. S., D. B. Smith, T. P. Leary, J. C. Erker, S. M. Desai, and I. K. Mushahwar. 1997. Identification of GB virus C variants by phylogenetic analysis of 5′-untranslated and coding region sequences. J. Virol.   71: 6501–6508. Google Scholar Naito, H., S. Hayashi, and K. Abe. 2000. The entire nucleotide sequence of two hepatitis G virus isolates belonging to a novel genotype: Isolation in Myanmar and Vietnam. J. Gen. Virol.   81: 189–194. Google Scholar Naito, H., K. M. Win, and K. Abe. 1999. Identification of a novel genotype of hepatitis G virus in Southeast Asia. J. Clin. Microbiol.   37: 1217–1220. Google Scholar Nakao, H., H. Okamoto, M. Fukuda, F. Tsuda, T. Mitsui, K. Masuko, H. Lizuka, Y. Miyakawa, and M. Mayumi. 1997. Mutation rate of GB virus C hepatitis G virus over the entire genome and in subgenomic regions. Virology 233:43–50. Google Scholar Pybus, O. G., E. C. Holmes, and P. H. Harvey. 1999. The mid-depth method and HIV-1: a practical approach for testing hypotheses of viral epidemic history. Mol. Biol. Evol.   16: 953–959. Google Scholar Saito, T., K. Ishikawa, M. Osei-Kwasi et al. (13 co-authors). 1999. Prevalence of hepatitis G virus and characterization of viral genome in Ghana. Hepatol. Res.   13: 221–231. Google Scholar Sawyer, S. 1989. Statistical tests for detecting gene conversion. Mol. Biol. Evol.   6: 526–538. Google Scholar Sharp, P. M., E. Bailes, F. Gao, B. E. Beer, V. M. Hirsch, and B. H. Hahn. 2000. Origins and evolution of AIDS viruses: estimating the time-scale. Biochem. Soc. Trans.   28: 275–282. Google Scholar Simmonds, P., and D. B. Smith. 1999. Structural constraints on RNA virus evolution. J. Virol.   73: 5787–5794. Google Scholar Simons, J. N., T. P. Leary, G. J. Dawson, T. J. Pilot-Matias, A. S. Muerhoff, G. G. Schlauder, S. M. Desai, and I. K. Mushahwar. 1995. Isolation of novel virus-like sequences associated with human hepatitis. Nat. Med.   1: 564–569. Google Scholar Smith, D. B., M. Basaras, S. Frost, D. Haydon, N. Cuceanu, L. Prescott, C. Kamenka, D. Millband, M. A. Sathar, and P. Simmonds. 2000. Phylogenetic analysis of GBV-C/hepatitis G virus. J. Gen. Virol.   81: 769–780. Google Scholar Suzuki, Y., K. Katayama, S. Fukushi, T. Kageyama, A. Oya, H. Okamura, Y. Tanaka, M. Mizokami, and T. Gojobori. 1999. Slow evolutionary rate of GB virus C/hepatitis G virus. J. Mol. Evol.   48: 383–389. Google Scholar Swofford, D. L. 1998. PAUP*. Phylogenetic analysis using parsimony (*and other methods). Version 4. Sinauer, Sunderland, Mass. Google Scholar Tanaka, Y., M. Mizokami, E. Orito et al. (15 co-authors). 1998. African origin of GB virus C/hepatitis G virus. FEBS Lett.   423: 143–148. Google Scholar Worobey, M., and E. C. Holmes. 1999. Evolutionary aspects of recombination in RNA viruses. J. Gen. Virol.   80: 2535–2543. Google Scholar Worobey, M., A. Rambaut, and E. C. Holmes. 1999. Widespread intra-serotype recombination in natural populations of dengue virus. Proc. Natl. Acad. Sci. USA 96:7352–7357. Google Scholar Zhou, Y. S., W. Chen, Q. M. Zhao, H. L. Zhao, J. S. Zhang, J. Xu, and H. T. Wang. 1996. cDNA cloning and sequencing of HGV genome from Chinese. Bull. Acad. Military Med. Sci.   20: 249–253. Google Scholar TI - Homologous Recombination in GB Virus C/Hepatitis G Virus JF - Molecular Biology and Evolution DO - 10.1093/oxfordjournals.molbev.a003799 DA - 2001-02-01 UR - https://www.deepdyve.com/lp/oxford-university-press/homologous-recombination-in-gb-virus-c-hepatitis-g-virus-6g0I1srHxW SP - 254 EP - 261 VL - 18 IS - 2 DP - DeepDyve ER -