[lucene] lucene忽略的一些英文单词
liubingjian
2012-03-19
使用中发现lucene忽略的一些英文单词,使用这些单词,查不到查询结果,例如:一些介词(on、in、of等),一些冠词(a、the等),谁知道有多少这样的词会被忽略,或者哪里能查得到。谢谢
|
|
wu_quanyin
2012-03-26
lucene里面有一个停词机制,,,把那些常用的词进行忽略,,,如果用IKAnalyzer可以在stopWord.txt中进行设置。。
|
|
sanzangc_sdn
2012-03-30
在定义含有stopWords分词器的时候都会指定stopWords,如果没有指定可以引用默认的stopWords,在StandardAnalyzer、StopAnalyzer和ClassicAnalyzer分词器中stopWords是
"a", "an", "and", "are", "as", "at", "be", "but", "by", "for", "if", "in", "into", "is", "it", "no", "not", "of", "on", "or", "such", "that", "the", "their", "then", "there", "these", "they", "this", "to", "was", "will", "with" |