python-weka-wrapper 0.3.1 released

Mainly a release with added support for parameter optimization and some tools for making life easier when dealing with options.

Here is the detailed list of changes since the 0.3.0 release:

Posted on April 23, 2015 at 10:16 am by fracpete · Permalink · Leave a comment
Tagged with: , , ,

python-weka-wrapper: 0.3.0 released

It’s been a while since the last release and there were quite a number of bugfixes and additions this time (eg database access, text mining), so well worth the upgrade. A major addition is the workflow component, encapsulating a lot of the functionality that the python-weka-wrapper library offers in a convenient way. Check out the examples in the examples repository, src/wekaexamples/flow sub-directory. The workflow was inspired by the one available from the ADAMS framework, though much simpler. It is a pure-Python implementation and has nothing to do with Weka’s KnowledgeFlow.

Changes:

Posted on April 15, 2015 at 10:54 pm by fracpete · Permalink · Leave a comment
Tagged with: , , ,

New nlp-weka-package release

Just released a new version of my new Weka package for natural language processing (NLP):

https://github.com/fracpete/nlp-weka-package

Changes:

Here is a screenshot of the new Explorer tab:

parse_trees_screenshot

Posted on March 30, 2015 at 2:26 pm by fracpete · Permalink · Leave a comment
Tagged with: , ,

serialized-model-viewer-weka-package released

Something that ADAMS’ Preview Browser has had for years, I’ve now added to Weka as a standalone tab in the Explorer: displaying the content of serialized model files. It allows the user to load a serialized model file (or actually any Java serialized file) and display its content, simply using the objects’ toString() method.

You can find the Weka package here:
https://github.com/fracpete/serialized-model-viewer-weka-package

Posted on March 26, 2015 at 1:28 pm by fracpete · Permalink · Leave a comment
Tagged with: , , ,

nlp-weka-package released

Just released the first version of my new Weka package for natural language processing (NLP):

https://github.com/fracpete/nlp-weka-package

At the moment, it contains only some filters (ChangeCase, PartOfSpeechTagging) and tokenizers (WhiteSpaceTokenizer, PTBTokenizer). It uses the Stanford parser for the NLP heavy lifting.

Posted on March 25, 2015 at 11:39 am by fracpete · Permalink · Leave a comment
Tagged with: , , ,

Google Code closes, moved projects to Github

Last week, Google announced to close Google Code, their platform for hosting open source projects.
I wasn’t particularly thrilled by that, since I have a number of projects hosted there. But, it was a free service, so you can’t expect them to continue throwing money at it, especially since more and more projects seem to move to Github. Hence I spent the whole day today, moving my remaining projects to Github as well:

For some of these Maven projects, I made new releases available, using the new groupId of com.github.fracpete. Check out the projects for more details.

Posted on March 16, 2015 at 5:06 pm by fracpete · Permalink · Leave a comment
Tagged with: ,

ssh into Virtualbox Linux guest

Just came across the How to SSH to a VirtualBox guest externally through a host? post on stackoverflow that explains how to use port forwarding in order to ssh into a linux machine running inside virtualbox. Unless you know that there is a port forwarding button hidden away beneath the Advanced network settings, you’d never find it…

Posted on March 16, 2015 at 9:33 am by fracpete · Permalink · Leave a comment
Tagged with: ,

ADAMS 0.4.8 released

Hot off the press comes ADAMS 0.4.8. It is mainly a bugfix release, as there were a number of things broken in the last release:

Also noticed today that the download link on mloss.org had been broken for the last release. Duh!

Posted on March 4, 2015 at 1:01 pm by fracpete · Permalink · Leave a comment
Tagged with: , ,

Minor bug-fix release for Collective-classification

Petr came across a bug that affected the output of the predictions generated from the test set. It worked fine for cross-validation, but not for the Random split and Unlabled/Test set modes. I’ve committed a fix and made a new release available:
https://github.com/fracpete/collective-classification-weka-package/releases/tag/v2015.2.27

Posted on February 27, 2015 at 1:10 pm by fracpete · Permalink · Leave a comment
Tagged with: , ,

IntelliJ IDEA vs Eclipse

I can’t remember for how long I have been an Eclipse user, but at least since the Callisto release some time in 2006. Back then I was deciding between NetBeans and Eclipse, with Eclipse winning, because it used less memory. Despite having used JBuilder in the past, I didn’t want to go back to using this platform.
Anyhow, in terms of IDE I felt a bit like a Windows user: being stuck with a piece of software that kind of does the trick but has these weird and unexplainable behaviors sometimes like:

And guess what, just like with Windows, rebooting usually fixes some of these problems. Worst bit, you simply accept it, because you are stuck with it.
Years ago, I tried IntelliJ IDEA, but it failed handling my multi-module Maven projects. Nothing left but to keep using Eclipse.

But then, two weeks ago, I made the switch to IntelliJ IDEA after all, after one of my students was able to work with the ADAMS code base without any problems. Took me a while getting things sorted out and configured, but after using it now for two weeks straight, it’s nice having not only a great editor that supports you writing code very efficiently, but also a solid and fast IDE! There are a few quirks, but none of them are dealbreakers!

So, to sum it up: I’m a happy IntelliJ IDEA user now! :-)

Posted on January 29, 2015 at 10:35 am by fracpete · Permalink · Leave a comment
Tagged with: