The variation and phylogenetic relationship of open reading frame 5 (ORF5) of 3 different laboratory strains of the original prototype Bucyrus strain of equine arteritis virus (EAV), the modified live virus vaccine (ARVAC, Fort Dodge Laboratories), and 18 field isolates of EAV from North America and Europe were determined by comparison of their gene sequences. The viruses differed from the published sequence by between 3 (99.6% homology) and 94 (87.8%) nucleotides and by between 3 (98.8%) and 24 (90.6%) amino acids. The field isolates differed from each other by between 2 (99.7%) and 110(85.7%) nucleotides and by between 1 (99.6%) and 26 (89.8%) amino acids. Comparison of the nucleotide sequences of these viruses indicates that although they are very closely related, the ORF5 of each Virus is distinct. The ORF5 of EAV encodes the G(L) envelope glycoprotein which expresses the neutralization determinants of the virus. Comparative analysis of the deduced amino acid sequence of the G(L) protein of the viruses identified three distinct variable regions (V-1 [aa 61-121], V-2 [141-178], and V-3 [aa 202-222]), a putative signal sequence (S [aa 1-18]), and four conserved regions (C-1 [aa 19-60], C-2 [aa 122-140], C-3 [aa 179-201], and C-4 [aa 223-255]). Amino acid substitutions in the V-1 region of the G(L) protein of EAV field isolates had significant effects on the predicted hydrophobicity and secondary structure of the protein, which Is potentially important because this region contains a major neutralization site. Estimation of genetic distances and phylogenetic tree analysis of these viruses identified four distinct groups of EAV isolates, including two North American (NA(1) and NA(2)) and two European (E(1) and E(2)) groups. The sequence data obtained from individual European and North American isolates suggest movement of viruses between the two continents. (C) 1995 Academic Press, Inc.