Conference paperBeing literate with large document collections: Observational studies and cost structure tradeoffs