Solr6.3 集成pinyin4j 分词插件

1、前期准备,需要用到pinyin4j-2.5.0.jar、pinyin4j-analyzers-4.3.1.jar这两个jar包

2、将pinyin4j-2.5.0.jar、pinyin4j-analyzers-4.3.1.jar这两个jar包复制到apache-tomcat-8.0.47-windows-x64\apache-tomcat-8.0.47\webapps\solr\WEB-INF\lib目录下。

3、在 C:\solr_config\fileInfo\conf\ managed-schema文件</schema>前增加如下配置:

<!-- ik分词 + pingying4j分词 -->
	<fieldType name="text" class="solr.TextField">
		<analyzer type="index">
			<tokenizer class="org.wltea.analyzer.lucene.IKTokenizerFactory" isMaxWordLength="false" useSmart="false" conf="ik.conf"/>
			<filter class="solr.LowerCaseFilterFactory" />
			<filter class="com.shentong.search.analyzers.PinyinTransformTokenFilterFactory" minTermLenght="2" />
			<filter class="com.shentong.search.analyzers.PinyinNGramTokenFilterFactory" minGram="1" maxGram="20" /> 
		</analyzer>
		<analyzer type="query">
			<filter class="solr.SynonymFilterFactory" synonyms="synonyms.txt" ignoreCase="true" expand="true" />
			<tokenizer class="org.wltea.analyzer.lucene.IKTokenizerFactory" isMaxWordLength="false" useSmart="false" conf="ik.conf"/>  
			<filter class="solr.LowerCaseFilterFactory" />
		</analyzer>  
	</fieldType>
	

重启tomcat查看拼音检索效果。

发布了1266 篇原创文章 · 获赞 275 · 访问量 290万+

猜你喜欢

转载自blog.csdn.net/zhouzhiwengang/article/details/103514471