The Clostridium thermocellum celE gene was expressed at high level in Escherichia coli TG1 when placed under the transcriptional and translational control of lacZ′ in pUC18; in the presence of a multicopy plasmid (pNM52) containing the lacIq gene, expression of full-length and truncated forms of celE was regulated by isopropyl-β-d-thiogalactopyranoside. Catalytically active endoglucanase E (EGE) produced by E. coli was subject to proteolytic processing. The main protein species produced from full-length and truncated forms of celE was around 40 kDa in size and had an N-terminal amino acid sequence corresponding to that derived for mature EGE from the nucleotide sequence; in addition, larger species of about 75 kDa, presumably corresponding to full-size EGE, were produced by E.coli containing the full-length celE gene. Even after removal of the signal peptide sequence, EGE produced by E. coli was secreted into the periplasm. Up to 157 bp could be deleted from the 5′ end of the celE gene without affecting the catalytic activity of EGE produced by E. coli. A polypeptide of M, 86 kDa, immunoreactive with anti-EGE antiserum, was demonstrated in the high-molecular-weight, cellulose-binding multiprotein aggregate recoverable from C. thermocellum culture supernatant. © 1990.