implement new XML API, using XML::Simple

Unfortunately, we cannot yet parse the additional hints (plural forms,
cases, parts of speech, etc., everything inside <repr> tags) because the
XML::Simple API does not retain the correct ordering of CDATA content
mixed with subtags... :-/

This also removes the options -SpellTolerance, -Morphology, and
-CharTolerance, which are no longer supported by the new API (as long as
I can see.)

Note that although the XML contains UTF-8 data, XML::Simple decodes
it to latin1, so we have to re-encode it to get good results.
This commit is contained in:
Roland Hieber
2017-03-11 06:10:24 +01:00
parent de4031dd70
commit 89ac6cbcd6
2 changed files with 59 additions and 112 deletions

View File

@@ -12,7 +12,7 @@ WriteMakefile(
'PREREQ_PM' => { 'Carp::Heavy' => 0,
'IO::Socket' => 0,
'MIME::Base64' => 0,
'HTML::TableParser' => 0
}
'XML::Simple' => 0
}
);