Stopwords for Weka

Stopwords support in Weka have always been a bit poor, to say the least. Initially, there was only a hard coded list, based on the Rainbow tool. However, simply having stopwords for the English language was a bit limited. Being able to supply your own list of stopwords in the StringToWordVector filter made the whole thing already a bit more flexible. But, you still couldn’t supply your own stopwords algorithm. Yesterday, I sat down and implemented a new class hierarchy centered around the weka.core.stopwords.StopwordsHandler interface. I added the following algorithms:

Eibe reworked the StringToWordVector filter today to make use of the new class hierarchy.

Posted on July 21, 2014 at 12:46 pm by fracpete · Permalink · Leave a comment
Tagged with: ,

python-weka-wrapper 0.1.8 released

Today, I released a new version of the python-weka-wrapper library: 0.1.8.
No new functionality, apart from being able to create Instance objects using Python lists as well rather than just Numpy arrays, just bugfixes: the scatterplot for datasets and the installer were broken.

Posted on June 26, 2014 at 11:00 pm by fracpete · Permalink · Leave a comment
Tagged with: , , ,

JFileChooser-Bookmarks project launched

Last year, while working on a consulting project, I had to export lots of screenshots from ADAMS. I got so annoyed at constantly having to click through my directory hierarchy, that I implemented a little accessory component for the JFileChooser in Swing, allowing me to define bookmarks. Man, that made it so much easier all of a sudden!

The component is modeled after the bookmarks from the file chooser that Gnome users have been familiar with for many years. Here is a screenshot:

jfilechooserbookmarks

You can find the project homepage here:
https://code.google.com/p/jfilechooser-bookmarks/

And on Maven Central:
http://search.maven.org/#search|ga|1|g%3A%22com.googlecode.jfilechooser-bookmarks%22

Posted on June 26, 2014 at 9:09 am by fracpete · Permalink · One Comment
Tagged with: ,

ADAMS 0.4.6 released

This release is mainly a bugfix release, due to the broken twitter replay functionality in the 0.4.5 release. But I also added a new addons module for MEKA (multi-label extension to WEKA), called adams-meka, to make it a worthwhile upgrade. :-)

As usual, you can download the new release from the ADAMS homepage:
https://adams.cms.waikato.ac.nz/release

Posted on June 23, 2014 at 4:25 pm by fracpete · Permalink · Leave a comment
Tagged with: , ,

Relaunch of ADAMS website

A few late nights and a high dose of bootstrap later, the new website for ADAMS is now finally available:
https://adams.cms.waikato.ac.nz/

Posted on June 20, 2014 at 9:50 am by fracpete · Permalink · Leave a comment
Tagged with: ,

python-weka-wrapper 0.1.6 release

After banging my head against the wall a few times today, trying to install python-weka-wrapper on Mac OSX, I finally got things working. Hence a new, minor bugfix release, with the following changes:

It is available from PyPi, as usual.

Posted on May 29, 2014 at 4:12 pm by fracpete · Permalink · Leave a comment
Tagged with: , , ,

Google Group for python-weka-wrapper library

In order to make things easier for people interacting, asking questions, posting patches, I created a Google Group for the python-weka-wrapper library:

https://groups.google.com/forum/#!forum/python-weka-wrapper

Feel free to join! :-)

Posted on May 27, 2014 at 10:30 pm by fracpete · Permalink · Leave a comment
Tagged with: , ,

MEKA on Maven Central

Yesterday, I worked hard to get the MEKA releases 1.5.0 through to 1.6.2 deployed on Maven Central. The next minor release will be the last one where manual deployment will be necessary. After that, I’ll be migrating the project to Maven.

Here you can find the MEKA artifacts:
http://search.maven.org/#search|ga|1|meka

I also started a new ADAMS module for MEKA: adams-meka. So far the progress has been really good. Probably won’t be too long before it leaves the incubating stage.

Posted on May 27, 2014 at 3:38 pm by fracpete · Permalink · Leave a comment
Tagged with: , ,

PTStemmer Weka package

Also created another Weka package for the PTStemmer developed by Pedro Oliveira:
https://github.com/fracpete/ptstemmer-weka-package

You can download package archives ready to install from the release section:
https://github.com/fracpete/ptstemmer-weka-package/releases

Posted on May 25, 2014 at 9:31 am by fracpete · Permalink · Leave a comment
Tagged with: , , , ,

Snowball stemmers

Just created a new Weka package for the snowball stemmers:
https://github.com/fracpete/snowball-stemmers-weka-package

You can download Weka packages from the release section of that github repository:
https://github.com/fracpete/snowball-stemmers-weka-package/releases

Posted on May 25, 2014 at 9:27 am by fracpete · Permalink · Leave a comment
Tagged with: , , , ,