ScreenCasts | Videos from screen

Text Processing (removing unwanted paragraph breaks)

If we source a public domain text on the web, such as those from the Gutenberg project, we may well find that each line of thext has a fixed length. This is because the text was probably scanned in and then converted to real text through OCR software. The text is laid out exactly as it was in the original source. This facsimile is not appropriate for our needs.

If you cannot see the screencast here then - Download the screencast

Size of file is: 11 MB

Duration: 00:06:55

415.02 seconds

Filed Under: Making eBooks