A personal email to me made me realize I have even better data
for E. coli. Kenn Rudd estimates from his E. coli sequence database
that the genome size is 4673600, while the number of genes is 3237.
This gives:
Rfrequency = log2(4673600/3237) = 10.5 bits per site.
This will be published in a forthcoming book:
@inproceedings{Rudd.Schneider1992,
author = "K. E. Rudd
and T. D. Schneider",
title = "Compilation of {{\em E. coli}} Ribosome Binding Sites",
pages = "?-?",
editor = "Jeffrey Miller",
booktitle = "A short course in bacterial genetics: A laboratory
manual and handbook for {{\em Escherichia coli}} and related bacteria",
publisher = "Cold Spring Harbor Laboratory Press",
address = "Cold Spring Harbor, New York",
year = "1992"}
Tom Schneider
National Cancer Institute
Laboratory of Mathematical Biology
Frederick, Maryland 21702-1201
toms at ncifcrf.gov