Archive for May 2006
YKConverter for Windows
Just a quick note to say that the YKConverter is now available as an executable for Windows.
The application default is to save all converted documents to a particular directory, shown in the preferences. If you’d prefer each file to be saved as text next to the original, switch that in the preferences too.
Sometimes the converter will fail to convert PDF documents. Most times this is because they have been locked by the author. Locking a PDF document means that no text can be extracted, either manually or automatically. Unfortunately, there’s not much I can do about that.
In the pipeline
Just got back from visiting to the friendly folk at Penn State’s Political Science department. Yoshikoder figured in my day course on text analysis.
Anyway, as a result of conversations in the lab, I’ve added some more stuff. When it’s finished a round of testing, it will be part of release 0.6.3. In the meantime here’s the list:
- Unified frequency report – Word frequencies for all the selected documents in the same table
- Multiple document addition – Select a large number of documents and import them all at once
- Statistical document comparison – How much more of each category is there in document A than in document B, with confidence intervals
There is one more goody in the collection, but that will get a post all of it own.
If there’s something you’d particularly like to see, let me know.