This tutorial will show you how to use a range of standard Perl features and numerous CPAN modules to read in, decipher, process, and reformat ASCII text data.
You will learn:
- how to balance nested brackets and match delimiters without a complex regular expression;
- how to recognize and process common text formats like CSV, and various tagged mark-up notations (such as HTML);
- how to preprocess archived text formats like (g)zip, tar, uuencoding, and MIME using only Perl;
- how to preprocess these same archived text formats by calling system level utilities from Perl;
- how to take advantage of Perl's I/O and regex mechanisms to decode simple data formats automatically during input;
- how to decipher and convert binary formats;
- how to convert your processed data to and from readable text, in either fixed or floating formats;
- how to handle ambiguity and errors when processing text;
- how to extract, process, and generate simple natural language data.
1 day seminar
Novice Perl programmers who are familiar with simple I/O and variables, and who want to a deeper insight into the techniques of Perl's "core business": extraction, manipulation, and reporting of data.