Feb. 15th, 2004

maradydd: (Default)
My reannotator, what there is of it so far, works. At the moment, that's just the bit that reads the reannotation information into an STL map (and a very complicated map it is; typedefs are my New Best Friend), but hey, it's crucial, and it's functional. I also grabbed myself a stemmer -- in case anyone else needs one, it is worth noting that PC-KIMMO is a morphological-analysis C library in its own right, but the site's weirdly silent on licensing information -- as I anticipate I'll need one when I start learning subcat frames off the Penn Treebank. (That's not the definition of "learning" where the knowledge learned goes into my brain, by the way.)

Tiny bites at a time seem to be the way to go on this thing. Next step: write the bit that reads in Brown-tagged text, splits it into word-tag pairs, and does the first reannotation pass off the map. Then: write the bit that disambiguates tags like TO and the being/having verbs. Then: work out how to automatically build a frequency table for subcategorisation frames that appear in Penn.

And then the real fun begins.

But now I have to go decorate my stairs.

On a wholly unrelated note, I should have found those Harman/Kardon speakers I wasn't using and ziptied them to my canopy bed long ago.

Profile

maradydd: (Default)
maradydd

September 2010

S M T W T F S
   1234
567891011
12131415 161718
19202122232425
26 27282930  

Most Popular Tags

Style Credit

Expand Cut Tags

No cut tags