Initial run

(Suzy's system except for gathering process)

Query Good Bad Both wrong Bad sentence Not sure
Alexander Graham Bell invented telephone 7 5 3 2 0

Comments

paused to collect results properly

Problems

  • Alexander Graham Bell was a Scot who invented the telephone . (i.e. rel clause, tries to hang it off Bell rather than Scot)
    • need to resolve anaphora? Or excluse who/whose/he/him/her/his sentences
  • remove ':' sentences and '?'s? (But ':' seem also to involve many improvements...)
  • error applying constraint to 'In short , it will be the greatest thing since Alexander Graham Bell invented the telephone in the 19th century .' (sim to Edmund Barton problem)

Experiment 1

Question: does it improve general performance?

Gen method:

  • automatically extract queries from freebase for (composer, bday), (inventor, invention), (director, film)
  • get 'top 40' (based on yahoo hit counts), and manually winnow 'bad' ones
  • extract constraints that are shared across queries, only from sentences < 15 tokens and 1 to 2 verbs (have to have two verbs, otherwise miss some nice ones).
  • apply nice constraints (e.g. containing a word >= 3 chars to avoid punc) that have 'maximal' match (e.g. if we have a word match on 4 constraints, ignore 3 constraint options) to all sentences; general our 'nice' output

Eval:

  • Train models on CCG bank subset +
    • nothing
    • constrained sentences
    • unconstrained sentences (i.e. a lot more data than constrained sentences)
  • Eval on Steve's wikipedia sentences + standard WSJ section

Results: ?

Experiment 2