Next: , Previous: Assembler scanner, Up: Extraction options


3.6.4 Text Scanner

The plain text scanner is intended for human-language documents, or as the scanner of last resort for files that have no scanner that is more specific. It is customizable to the extent that character classes can be designated as token constituents or as token delimiters. The default token constituents are the alpha-numerics; all other characters are considered token delimiters.

`-i character-class'
`--include=character-class'
Include characters belonging to character-class in tokens.
`-x character-class'
`--exclude=character-class'
Exclude characters belonging to character-class from tokens, i.e., treat them as token delimiters.