jeisner / treebank-scripts

Suite of scripts for preprocessing the Penn Treebank, primarily to extract lexical subcategorization frames and dependencies.
MIT License
7 stars 1 forks source link

problem with slashnulls #11

Open jeisner opened 8 years ago

jeisner commented 8 years ago

[item from the old TO-DO file dated 2002-04-07]

Problem with slashnulls: it's hacked S~-1 to S~-1\NP~-SBJ-54 in the example below, but that continues to be passed up as S~-1. Need to change it throughout.

00/wsj_0037.mrg:131:    (S
                          (NP~-SBJ\NP~-SBJ-1
                            @(-NONE- @it))
                          @(VP+S~-1
                            @(VBZ @'s)
                            (ADVP @(RB @also))
                            (ADJP-PRD
                              @(VBG @refreshing))
                            (S~-1\NP~-SBJ-54
                              (NP~-SBJ\NP~-SBJ-54
                                @(-NONE- @0))
                              @(VP (TO @to)
                                   @(VP~ @(VB @read)
                                         (NP~
                                           @(NP (DT @a) (JJ @Japanese)
                                                @(NN @author))
                                           (SBAR
                                             @(WHNP-52
                                               @(WP @who))
                                             (S~\NP~-SBJ-52
                                               (NP~-SBJ\NP~-SBJ-52
                                                 @(-NONE- @0))
                                               (ADVP @(RB @clearly))
                                               @(VP (VBZ @does) (RB @n't)
                                                    @(VP~ @(VB @belong)
                                                          (PP-CLR
                                                            @(TO @to)
                                                            (NP~
                                                              @(NP (DT @the) (JJ @self-aggrandizing) (`` @``) (JJ @we-Japanese) ('' @'')
                                                                   @(NN @school))
                                                              (PP @(IN @of)
                                                                  (NP~
                                                                    @(NP @(NNS @writers))
                                                                    (SBAR
                                                                      @(WHNP-53
                                                                        @(WP @who))
                                                                      (S~\NP~-SBJ-53
                                                                        (NP~-SBJ\NP~-SBJ-53
                                                                          @(-NONE- @0))
                                                                        @(VP @(VBP @perpetuate)
                                                                             (NP~
                                                                               @(NP (DT @the)
                                                                                    @(NN @notion))
                                                                               (PP @(IN @of)
                                                                                   (NP~
                                                                                     @(NP (DT @the) (JJ @unique)
                                                                                          @(NN @Japanese))
                                                                                     (, @,)
                                                                                     (ADJP @(JJ @unfathomable)
                                                                                           (PP @(IN @by)
                                                                                               (NP~ @(NNS @outsiders)))))))))))))))))))))))
                          (. @.))