TY - JOUR
T1 - Sequence, structure and function-based classification of the broadly conserved FAH superfamily reveals two distinct fumarylpyruvate hydrolase subfamilies
AU - Hong, Hwaseok
AU - Seo, Hogyun
AU - Park, Woojin
AU - Kim, Kyung Jin
N1 - Publisher Copyright:
© 2019 Society for Applied Microbiology and John Wiley & Sons Ltd.
PY - 2020/1/1
Y1 - 2020/1/1
N2 - Fumarylacetoacetate hydrolase (FAH) superfamily proteins are found ubiquitously in microbial pathways involved in the catabolism of aromatic substances. Although extensive bioinformatic data on these proteins have been acquired, confusion caused by problems with the annotation of these proteins hinders research into determining their physiological functions. Here we classify 606 FAH superfamily proteins using a maximum likelihood (ML) phylogenetic tree, comparative gene-neighbourhood patterns and in vitro enzyme assays. The FAH superfamily proteins used for the analyses are divided into five distinct subfamilies, and two of them, FPH-A and FPH-B, contain the majority of the proteins of undefined function. These subfamilies include clusters designated FPH-I and FPH-II, respectively, which include two distinct types of fumarylpyruvate hydrolase (FPH), an enzyme involved in the final step of the gentisate pathway. We determined the crystal structures of these FPH enzymes at 2.0 Å resolutions and investigate the substrate binding mode by which these types of enzymes can accommodate fumarylpyruvate as a substrate. Consequentially, we identify the molecular signatures of the two types of FPH enzymes among the broadly conserved FAH superfamily proteins. Our studies allowed us to predict the relationship of unknown FAH superfamily proteins using their sequence information.
AB - Fumarylacetoacetate hydrolase (FAH) superfamily proteins are found ubiquitously in microbial pathways involved in the catabolism of aromatic substances. Although extensive bioinformatic data on these proteins have been acquired, confusion caused by problems with the annotation of these proteins hinders research into determining their physiological functions. Here we classify 606 FAH superfamily proteins using a maximum likelihood (ML) phylogenetic tree, comparative gene-neighbourhood patterns and in vitro enzyme assays. The FAH superfamily proteins used for the analyses are divided into five distinct subfamilies, and two of them, FPH-A and FPH-B, contain the majority of the proteins of undefined function. These subfamilies include clusters designated FPH-I and FPH-II, respectively, which include two distinct types of fumarylpyruvate hydrolase (FPH), an enzyme involved in the final step of the gentisate pathway. We determined the crystal structures of these FPH enzymes at 2.0 Å resolutions and investigate the substrate binding mode by which these types of enzymes can accommodate fumarylpyruvate as a substrate. Consequentially, we identify the molecular signatures of the two types of FPH enzymes among the broadly conserved FAH superfamily proteins. Our studies allowed us to predict the relationship of unknown FAH superfamily proteins using their sequence information.
UR - http://www.scopus.com/inward/record.url?scp=85074871305&partnerID=8YFLogxK
U2 - 10.1111/1462-2920.14844
DO - 10.1111/1462-2920.14844
M3 - Article
C2 - 31657110
AN - SCOPUS:85074871305
SN - 1462-2912
VL - 22
SP - 270
EP - 285
JO - Environmental Microbiology
JF - Environmental Microbiology
IS - 1
ER -