Text Processing (removing unwanted paragraph breaks)

If we source a public domain text on the web, such as those from the Gutenberg project, we may well find that each line of thext has a fixed length. This is because the text was probably scanned in and then converted to real text through OCR software. The text is laid out exactly as it was in the original source. This facsimile is not appropriate for our needs.

