
Originally Posted by
wysota
Follow the link I gave you and scroll down.
Sorry, I saw it; I don't know how to use it; so I thought to put it inside the java code on this way:
List<String> stopwords = new ArrayList<String>(Arrays.asList("a", "about",................);
List<String> stopwords = new ArrayList<String>(Arrays.asList("a", "about",................);
To copy to clipboard, switch view to plain text mode
Is it too much ugly?
What I need with lemmatization should be an hint about something very fast to use; is there any Java standard library? I didn't find it and it seems odd to me...
thanks,
EDIT:
one more thing not claer: I removed the stop word while reading them from file; but I know that there's a technicque that remove the words with highest frequency; stop-word and this last technique are the same thing?
Bookmarks