Gene regulation and the lac operon
Gene regulation is the process by which cells, whose DNA content is constant (for the most part) over their lifetime, change their behavior. The gene content of a particular cell determines the functions that the cell can perform, the the functions that it actively perform is determined by which subset of these genes is expressed. Therefore, gene expression must be toggled according to the environment, so that a cell can function in variable environments. In this post will discuss (and write down a toy model for) one of the most famous examples of a gene regulation circuit: the lac operon, which governs the metabolism of lactose in E. coli. This circuit was understood in 1961 by Jacob and Monod. Before jumping into the mechanism, we will define some terms.
Transcription
Transcription is the process by which a DNA sequence becomes a piece of messenger RNA, which in turn is converted into a protein. It is worth noting that most RNA that is transcribed never becomes a protein: a large fraction of RNA has a structural function in the ribosome, which is the machine that makes proteins from messenger RNA. For example, in S. cerevisiae, 60% of the cell’s total transcription is devoted to transcription of ribosomal RNA. However for the purposes of the lac operon, we will be talking about transcription of so-called “coding genes” which do get transcribed. Transcription is carried out by the enzyme RNA polymerase, which travels down a strand of DNA, producing a single strand of RNA corresponding to one of the two DNA strands (which are complementary to one another). This mRNA later gets translated into a protein, which carried out some function for the cell.
Transcription factors and promoters
A transcription factor is a protein responsible for regulating the expression of genes, via binding to specific DNA sequences, which fall into two categories: enhancers and promoters. Promoters are sequences of DNA situated upstream of genes, and the transcription factors that bind to them are called general transcription factors. The binding of these transcription factors increases the recruitment of RNA polymerase and therefore the rate of transcription of the downstream gene. Enhancers are regions of the genome that have a similar effect, but can be located either upstream or downstream of the gene whose expression they regulate — they carry out regulation by virtue of their physical proximity to the gene of interest: since DNA is a long and curved molecule, regions need not be close along the length of the strand in order to be close in physical space. Transcription factors which bind to enhancers also increase transcription rates of genes by recruiting RNA polymerase.
Trascription factors (TFs) constitute a sizeable fraction of all genes; in humans there are
The lac operon
E. coli have the ability to consume multiple different types of sugar, but that doesn’t mean that they are agnostic to what they eat. If glucose is present, even if other sugars such as lactose are present, the bacteria will only consume the glucose. Once the glucose runs out, they will shift their metabolism and start consuming lactose. This shift is called a diauxic shift, and occurs as a result of changing gene expression in the cell in response to the environment. E. coli synthesizes an enzyme that breaks down lactose, but only in the joint presence of lactose and absence of glucose. How does it control this process?
The lac operon is a collection of three genes under the control of a single promoter. These genes code for proteins that enable the metabolism of lactose: one enzyme (lacZ) cleaves lactose into glucose and galactose, the other (lacY) transports lactose into the cytoplasm through the membrane, and third (lacA) codes for a protein whose function, amazingly, remains unknown. These three genes, put together, amount to around 5000 bases (5kb) of DNA. The E. coli cell needs to transcribe these genes at a higher rate when there is lactose but no glucose in the environment.
In order to accomplish this behavior, a system has evolved where the default state is repression. That is, in absence of any external stimulus, there is a repressor protein which binds near to the lac operon promoter (at a region called the operator) and prevents transcription of the associated genes. When there is lactose in the environment, it is metabolized at a low rate, and a product of this metabolism, allolactose, binds to the repressor, changing its shape and making it unable to block transcription. RNA polymerase can now transcribe the operon, increasing the lactose metabolism rate. Note that this order of events displays positive feedback: a little metabolism of lactose produces more allolactose which produces more lactose metabolism, etc.
However, in addition to being transcribed in the presence of lactose, the lac operon is repressed in the presence of glucose. This repression is controlled by a signaling molecule called cAMP, which varies inversely with glucose concentration in the cell—it essentially indicates hunger for glucose. When cAMP concentrations are high, the cAMP molecules form a complex with a molecule called CAP, and bind upstream of the promoter, encouraging transcription. Conversely when glucose is high there is little cAMP in the cell, and so transcription of the lac operon is reduced, because the cAMP-CAP complex does not get created and upregulate transcription.

A mathematical model
How can we make a simple mathematical model of this process? To begin with, we can try to understand the following observation: there is a sudden shift in levels of lac operon transcription as a function of lactose levels, as opposed to a continuous change. Once we understand this switch-like behavior, we can also try to understand when bistability occurs: namely, if the cell is mostly consuming glucose, and enough lactose is introduced, the cell undergoes a diauxic shift and remains in mostly-lactose-metabolism mode even when the original conditions are restored.
Let us denote the number of lacZ transcripts in the cell as
where
Enzyme kinetics
Imagine that we have a certain concentration
In equilibrium, what is the concentration of the
Therefore in steady state, this time derivative vanishes and we have
This equation, also known as the Michaelis Menten equation, will be our friend in trying to understand reaction rates when one of the reagents is limiting. Note the behavior of
The exponent
Back to the lac operon
Returning back to our model for the regulation of the lac operon, we can first posit a form for
Now we want to guess a form for
We can therefore approximate the concentration of unbound promoters as
We now have a set of differential equations that describe the levels of lacZ and allolactose in the cell. Parameterizing unknown rates by
Analysis
We can analyze the behavior of our differential equation model by looking at the nullclines of our system: the lines in the

Now, how does glucose come into the picture? Recall that glucose decreases the prevalence of the cAMP-CAP complex, which itself encourages transcription. To capture this effect we can add a term proportional to
Bistability

Using what we have learned, we can now ask what happens if multiple allolactose molecules are needed to bind to a repressor to stop it from repressing transcription. Then, as we discussed above, the shape of
In terms of the lac operon, this would correspond to a situation with hysteresis in the levels of lacZ as a function of lactose levels. This is certainly plausible biologically, as it may be favorable to consume lactose until the levels are depleted beyond the level at which lactose metabolism turns on, particularly if switching metabolisms is costly.
Summary
We have seen how positive feedback can give rise to switch-like behavior and bistability, in the lac operon.
It is possible to obtain similar types of behavior with negative feedback, which is also relevant in biological systems such as the lysogenic life cycle of
References and further reading
Daniel Fisher’s cellular biophysics class
Mitchell Lewis, A Tale of two Repressors