位置:lucene-solr/lucene/analysis
analysis結(jié)構(gòu)

需要關(guān)注的是
common/src/java
opennlp/src/java
smartcn/src/java
Analysis README file
INTRODUCTION
The Analysis Module provides analysis capabilities to Lucene and Solr
applications.
分析模塊為L(zhǎng)ucene和SoR提供分析能力。
The Lucene web site is at:
http://lucene.apache.org/
Please join the Lucene-User mailing list by sending a message to:
java-user-subscribe@lucene.apache.org
FILES
lucene-analyzers-common-XX.jar
The primary analysis module library, containing general-purpose analysis
components and support for various languages.
包含通用分析的主分析模塊庫(kù)
各種語(yǔ)言的組件和支持。
lucene-analyzers-icu-XX.jar
An add-on analysis library that provides improved Unicode support via
International Components for Unicode (ICU). Note: this module depends on
the ICU4j jar file (version >= 4.6.0)
通過提供改進(jìn)的Unicode支持的附加分析庫(kù)
Unicode(ICU)的國(guó)際組件。注意:此模塊依賴于
ICU4J jar文件(版本>=4.6)
lucene-analyzers-kuromoji-XX.jar
An analyzer with morphological analysis for Japanese.
lucene-analyzers-morfologik-XX.jar
An analyzer using the Morfologik stemming library.
使用MyfRooik詞干庫(kù)的分析器。
lucene-analyzers-nori-XX.jar
An analyzer with morphological analysis for Korean.
lucene-analyzers-opennlp-XX.jar
An analyzer using the OpenNLP natural-language processing library.
一個(gè)使用OpenNLP自然語(yǔ)言處理庫(kù)的分析器。
lucene-analyzers-phonetic-XX.jar
An add-on analysis library that provides phonetic encoders via Apache
Commons-Codec. Note: this module depends on the commons-codec jar
file
一個(gè)通過Apache提供語(yǔ)音編碼器的附加分析庫(kù)
公鑰編解碼器。注意:這個(gè)模塊依賴于公鑰編譯碼器jar文件
lucene-analyzers-smartcn-XX.jar
An add-on analysis library that provides word segmentation for Simplified
Chinese.
為簡(jiǎn)體中文分詞提供的分析庫(kù)。
lucene-analyzers-stempel-XX.jar
An add-on analysis library that contains a universal algorithmic stemmer,
including tables for the Polish language.
common/src/java
icu/src/java
kuromoji/src/java
morfologik/src/java
nori/src/java
opennlp/src/java
phonetic/src/java
smartcn/src/java
stempel/src/java
The source code for the libraries.
common/src/test
icu/src/test
kuromoji/src/test
morfologik/src/test
nori/src/test
opennlp/src/test
phonetic/src/test
smartcn/src/test
stempel/src/test
Unit tests for the libraries.