Self Training

  • Research questions:
    • can we improve performance by training on self labelled data?
    • how to train the supertagger using the parser

In Progress

  • Implementing a map-reduce based combining of the features

Progress Updates

  • parallelised the feature extraction process
  • broke up the wikipedia dump and parsed it, producing ccgbank derivations and grs
  • parsed sections 2-21 of WSJ and retrained the supertagger on the parsed output
  • added features to the supertagger
    • extending the window to three words on either side rather than two
    • trigram tags (up to 3 tags both directions)
    • bigram and trigram words (up to 3 words both directions)
    • added flags for all extended features (all false by default except bigram tags up to 2 in both directions)
  • added MapReduce MPI implementation to supertagger training process

Stages of Parallelization of Supertagger Training

  • Stage One: no parallelization

  • Stage Two: Jono's MPI implementation of weight estimation

  • Stage Three: Jono & Jessi's python script for parallelizing feature extraction

  • Stage Four: James & Jessi's MapReduce MPI implementaiton of entire training process

  • Stage Five: complete parallelization, including initial training data

Evaluating the Parser

  • to evaluate parser output (grs):
    • run the output of the parser through the post-processing script (in candc/candc-1.00/src/scripts/ccg/grs2depbank)
    • run evalParse.pl (attached)
      • usage: perl evalParse.pl <TEST_GRS> <GOLD_GRS>
      • gold grs attached (Steve's 300 annotated sentences)

Experiments

  • retraining the POS tagger on WSJ plus 60 gold standard Wikipedia sentences from Steve's annoated data, duplicated 10 times and added to WSJ 02-21
    • evaluated on Wikipedia data
    • results
      • trained on WSJ only: 97.87%
      • trained on WSJ + Wikipedia: 97.68%
  • retraining the supertagger on WSJ plus 60 gold standard Wikipedia sentences, duplicated 10 times and added to WSJ 02-21
training Gold POS tags Auto POS tags
WSJ only 91.05% 89.95%
WSJ+Wiki 91.26% 90.29%
  • evaluating the parser using the newly trained POS tagger and supertagger models
training Gold POS tags Auto POS tags
WSJ only P = 82.27; R = 82.72; F = 82.48 P = 81.01; R = 80.85; F = 80.90
WSJ+wiki P = 82.21; R = 82.82; F = 82.50 P = 80.15; R = 82.18; F = 81.15

Attachments

  • jhu-undergrad-jessi.pdf (488.8 kB) -Jessi's presentation on MapReduce for undergrad presentation, added by jessi on 07/17/09 05:35:58.
  • process1.png (17.3 kB) - added by jessi on 07/17/09 06:24:35.
  • process2.png (27.9 kB) - added by jessi on 07/17/09 06:27:35.
  • process3.png (34.9 kB) - added by jessi on 07/17/09 06:28:18.
  • process4.png (44.3 kB) - added by jessi on 07/17/09 06:28:32.
  • process5.png (50.2 kB) - added by jessi on 07/17/09 07:06:58.
  • evalParse.pl (6.0 kB) -Script for evaluating parser GRs output, added by jessi on 07/24/09 04:22:04.
  • sent300.bandc.grs.corrected (257.6 kB) -300 gold standard GRs annotated by Steve, added by jessi on 07/24/09 04:22:54.