Yoshikoder

What’s new with the Yoshikoder?

Archive for May 2006

YKConverter for Windows

without comments

Just a quick note to say that the YKConverter is now available as an executable for Windows.

The application default is to save all converted documents to a particular directory, shown in the preferences. If you’d prefer each file to be saved as text next to the original, switch that in the preferences too.

Sometimes the converter will fail to convert PDF documents. Most times this is because they have been locked by the author. Locking a PDF document means that no text can be extracted, either manually or automatically. Unfortunately, there’s not much I can do about that.

Written by Will

May 4, 2006 at 10:06 pm

Posted in Development

In the pipeline

with one comment

Just got back from visiting to the friendly folk at Penn State’s Political Science department. Yoshikoder figured in my day course on text analysis.

Anyway, as a result of conversations in the lab, I’ve added some more stuff. When it’s finished a round of testing, it will be part of release 0.6.3. In the meantime here’s the list:

  • Unified frequency report – Word frequencies for all the selected documents in the same table
  • Multiple document addition – Select a large number of documents and import them all at once
  • Statistical document comparison – How much more of each category is there in document A than in document B, with confidence intervals

There is one more goody in the collection, but that will get a post all of it own.

If there’s something you’d particularly like to see, let me know.

Written by Will

May 2, 2006 at 7:26 pm

Posted in Development