2009/07/02

Sanity test

sanity test:

training:




1 a a a CD_A CD_A _ _ 0 0 NMOD_A NMOD_A Y A _
2 b b b DT_B RBR_B _ _ 1 1 NMOD_B NMOD_B _ _ l1



1 d d d CD_D CD_D _ _ 0 0 NMOD_D NMOD_D Y D _
2 c c c DT_C RBR_C _ _ 1 1 NMOD_C NMOD_C _ _ l2
3 e e e DT_E RBR_E _ _ 1 1 NMOD_E NMOD_E _ _ l3



testing:








1 d d d CD_D CD_D _ _ 0 0 NMOD_D NMOD_D Y D _
2 e e e DT_E RBR_E _ _ 1 1 NMOD_E NMOD_E _ _ l3










1 a a a CD_A CD_A _ _ 0 0 NMOD_A NMOD_A Y A _
2 b b b DT_B RBR_B _ _ 1 1 NMOD_B NMOD_B _ _ l2
3 e e e DT_E RBR_E _ _ 1 1 NMOD_E NMOD_E _ _ l3




system output:

1 d d _ CD_D _ _ _ 0 0 NMOD_D _ Y D _
2 e e _ DT_E _ _ _ 1 1 NMOD_E _ _ _ l3

1 a a _ CD_A _ _ _ 0 0 NMOD_A _ Y A _
2 b b _ DT_B _ _ _ 1 1 NMOD_B _ _ _ l1
3 e e _ DT_E _ _ _ 1 1 NMOD_E _ _ _ l3

results:
correct = 2 wrong = 1
as I expected

5 hours to execute English data set.

I found doubles spaces in the training set, that is why input and output files have different amount of lines.

No comments:

Post a Comment