6 lines
318 B
Plaintext
6 lines
318 B
Plaintext
|
This class knows how to read two treebank formats, the Penn format and
|
||
|
the Chomsky Normal Form (CNF) format. These formats differ in how they
|
||
|
handle terminal nodes. The Penn format places pre-terminal part of
|
||
|
speech tags in the left-hand position of a parenthesis-delimited pair,
|
||
|
just like it does non-terminal nodes.
|