HERV-H elements are a large family of endogenous retrovirus-like sequences found in approximately 1000 dispersed copies in the genomes of humans and other primates. The most abundant subclass of these elements is a partially deleted form of 5.8 kb which is transcribed primarily as a 5.6-kb unit length RNA and a 3.7-kb spliced derivative. The provirus-like structure of these elements suggests that their numbers have increased in the genome through retrotransposition. However, this has not been demonstrated for HERV-H. To determine if genomic expansion of HERV-H elements involved an RNA intermediate, primate DNAs were screened by PCR for elements that were transcribed, spliced, reverse transcribed, and integrated back into the genome. This PCR screen detected several genomic HERV-H fragments that appear to be derived from spliced transcripts. Interestingly, the presence of one of these fragments is polymorphic in humans, suggesting that its integration was a relatively recent event. Another PCR strategy was used to determine that at least one of the spliced elements has an intact 5' LTR, indicating that it is not simply a ''processed pseudogene'' or cDNA copy of a HERV-H transcript. Genomic cloning and sequencing of a human locus harboring a spliced element revealed the expected structure, e.g., intact LTRs and flanking 5-bp direct repeats, for a virally retrotransposed element. A genomic library screening method also indicated that very few HERV-H elements (less than 1%) have the structure of processed pseudogenes. These results suggest that most HERV-H elements amplified in the genome as viral retrotransposons. (C) 1995 Academic Press, Inc.