E. coli RfaH
Paula Cancelas Calvo '22 and Christopher Ponne '22
Contents:
I. Introduction
The Escherichia coli RfaH protein is a universally
conserved transcription factor of the NusG/Spt5 family and has only
been studied in Escherichia coli. This family of proteins are the only
transcription factors that have coevolved with RNAP since the last
universal common ancestor. They also have a unique method to recruit
and elongate RNA polymerase (RNAP) for the transcription of virulence
genes. RfaH, in particular, is found to activate long operons that
encode antibiotics, capsules, toxins, and pili through inhibition of
Rho-dependent termination (Zuber, 2018).
When RNAP encounters transcriptional pausing or termination,
RfaH will work as an antiterminator to assist in elongation. RfaH
binds directly to the ops sequence located in the non-template (NT)
DNA strand of the transcription bubble. Contact with the ops
sequence is thought to trigger domain dissociation, transforming
RfaH into an open, active state where the N-terminal domain
can bind to RNAP and the C-terminal domain
changes structure. Once activated, RfaH remains bound to the
transcription elongation complex (TEC) until termination. The CTD
recruits the 30S subunit of the ribosome to the template/leader
sequence where the ribosome will then translate the RNA after
transcription. After TEC dissociation, RfaH regains the
autoinhibited state and the cycle repeats (Zuber, 2018). By
understanding this protein it may be possible to inhibit RfaH and
enable termination of viral genes found in any cell, including human
cells.
Figure 1.
RfaH recruitment to the ops-paused RNAP via the hairpin loop in the NT strand of the ops sequence. RfaH activates and upon release the CTD changes into a beta barrel. The CTD interacts with the small subunit of the ribosome to begin translation of the RNA afterwards, RfaH is finally released along with RNAP. The process repeats with another ops sequence (Zuber, 2018).
II. General Structure
Proteins of the NusG/Spt5 family usually have
N-terminal domains (NTD) of mixed alpha and beta
topology connected to beta barrel C-terminal domain
(CTD) along with a KOW motif via a flexible linker called the N/C linker.
Rfah, however, the CTD consists of two
in an alpha helical hairpin that have 2 acidic residues
at the tip of the domain (
and
). The
NTD binds across the DNA-binding channel, bridging the
RNAP beta prime clamp and beta lobe domains pincers and
resulting in RNAP becoming pause-resistant. On release,
the alpha helical CTD spontaneous forms into a beta
barrel (Zuber, 2018)
III. RfaH and ops interaction
The main difference between NusG and RfaH is that
RfaH action depends on the ops site. The ops site is
thought to play four roles during RfaH recruitment,
first it can slow down RNAP to allow more time for RfaH
recruitment. Rfah targets all have a pause-inducing TG
dinucleotide at position 11 and 12 (not shown) which
allows RNAP delay.Second, it mediates sequence-specific
binding of RfaH to the NT DNA strand exposed on the
surface of the TEC. Third, induces TEC isomerization
into a structurally distinct paused state that may be
necessary for productive recruitment of RfaH. Finally,
pausing could be required for ribosome recruitment.
T11, as previously mentioned, has
an essential role in pausing RfaH activity. Without
RfaH, RNAP pauses at C9 and U11 in the ops template
strand. With RfaH, pausing at U11 is reduced but
increases at G12. However, pausing at U11 is
dispensable for RfaH binding when RNAP transcribes
slowly (Belogurov, 2007 and Zuber, 2018). It is
important to remember that the ops sequence/site is
located in the NT hairpin.
IV. Non-template Hairpin loop interaction
with RfaH-NTD
The NT hairpin is required for RfaH
recruitment. RfaH-NTD binds to the ops sequence in
the NT DNA hairpin loop which forms when the DNA
binds to the basic patch of RfaH-NTD, opposite of
the RNAP/RfaH-CTD binding site. The
loop
consists of G4-A7 , with
flipped
out leaving the nucleobase completely exposed. The
other nucleotides of the loop make stacking
interactions.
The flipped T6 is inserted
into a deep narrow,
on Rfah-NTD, which is formed of H20, R23, Q24,
and R73 located in helices alpha 1 and alpha 2.
is
packed in the positive surface next to the
cavity T6 is located. RfaH-NTD makes contact
with nucleotides in the loop, involving K10,
H20, R23, Q24, T68, N70, A71, T72, R73, G74,and V74
.
Base specific interactions with RfaH-NTD are
made G4, G5, and T6, but only G5 and T6 form a
hydrogen bond network
and may underline sequence specific recognition.
The side chains K10,
H20, R23, and R73 directly
interact with the ops DNA. There are no aromatic
residues located near G5 and T6, therefore
contact between these two nucleotides and these
4 amino acids actually mediate specific
recognition of ops by RfaH. The hairpin stem is
formed by base pairs C3:G8 and G2:C10, with T9
being flipped out (Zuber, 2018).
Figure
2.
VI. References
Belogurov, G.
A., Vassylyeva, M. N., Svetlov,
V., Klyuyev, S., Grishin, N.V.,
Vassylyev, D. G., and
Artsimovitch, I. “Structural basis
for converting a general
transcription factor into an
operon-specific virulence
regulator.” Molecular cell
vol. 26,1 (2007): 117-29.
Kang, J.
Y., Mooney, R. A., Nedialkov,
Y., Saba, J., Mishanina, T. V.,
Artsimovitch, I., Landick, R.,
and Darst, S. A. “Structural
Basis for Transcript Elongation
Control by NusG Family Universal
Regulators.” Cell vol.
173,7 (2018): 1650-1662.e14.
Zuber, P.
K., Artsimovitch, I.,
NandyMazumdar, M., Liu, Z.,
Nedialkov, Y., Schweimer, K.,
Rösch, P., and Knauer, S. H.
“The universally-conserved
transcription factor RfaH is
recruited to a hairpin structure
of the non-template DNA strand.”
eLife vol. 7 e36349. 9 May.
2018.
Back to Top