Simulating an $L=10^6$ genome

Next we simulate many different motifs of the same length, and with many different choices of length

For sequences where all four nucleotides appear independently with probability $1/4$, $$ \mathrm{Critical\;motif\;length}\approx \frac{1}{2}\log_2L $$