NLTK Chunking i chodzenie w drzewku wyników

Używam NLTK RegexpParser do wyodrębniania grup rzeczowników i grup verbgroups z tagowanych tokenów.NLTK Chunking i chodzenie w drzewku wyników

Jak przejść drzewo wynikowe, aby znaleźć tylko porcje należące do grup NP lub V?

from nltk.chunk import RegexpParser 

grammar = ''' 
NP: {<DT>?<JJ>*<NN>*} 
V: {<V.*>}''' 
chunker = RegexpParser(grammar) 
token = [] ## Some tokens from my POS tagger 
chunked = chunker.parse(tokens) 
print chunked 

#How do I walk the tree? 
#for chunk in chunked: 
# if chunk.??? == 'NP': 
#   print chunk

(S (NP nośnika/NN) w/IN tkankowo/JJ i/CC hodowli komórkowej/JJ w/IN (NP/dt preparatem/NN) o/w (np implantów/NNS) i/CC (NP implantu/NN) (V zawierający/VBG) (NP/dt nośnika/NN) ./).

Źródło

2011-10-01 Vincent Theeten

to powinno pracować :

for n in chunked: 
    if isinstance(n, nltk.tree.Tree):    
     if n.label() == 'NP': 
      do_something_with_subtree(n) 
     else: 
      do_something_with_leaf(n)

Źródło

2011-10-01 09:31:03

Daje mi AttributeError: 'krotki' obiekt ma atrybut 'węzeł' n jest –

edytowany odpowiedź ... –

działa jak czar - dzięki! –

Mały błąd w token

from nltk.chunk import RegexpParser 
grammar = ''' 
NP: {<DT>?<JJ>*<NN>*} 
V: {<V.*>}''' 
chunker = RegexpParser(grammar) 
token = [] ## Some tokens from my POS tagger 
//chunked = chunker.parse(tokens) // token defined in the previous line but used tokens in chunker.parse(tokens) 
chunked = chunker.parse(token) // Change in this line 
print chunked

Źródło

2012-08-03 09:41:50 Wazzzy

odpowiedź Savino jest wielki, ale warto też zauważyć, że poddrzewa mogą być dostępne przez indeks, jak również, na przykład

for n in range(len(chunked)): 
    do_something_with_subtree(chunked[n])

Źródło

2014-01-15 14:57:09 TheKevJames

NLTK Chunking i chodzenie w drzewku wyników

Odpowiedz

Powiązane problemy