Solr:Text analysis - 代码天地

Solr:Text analysis

企业开发 2018-05-12 16:54:03 阅读次数: 0

------------------------------------------------------------------------------------------------------------------------------------

Defining a custom field type for microblog text

------------------------------------------------------------------------------------------------------------------------------------

Advanced text analysis

How do you select the right text analyzer during indexing? Assuming you want to index all your documents regardless of language in the same index, a simple solution would be to use a unique field for each language. Suppose we want to index French tweets in our microblog search application. We could define the following field:

<field name="text_fr" type="text_microblog_fr"
indexed="true" stored="true" />

-------------------------------------------------------------------------------------------------------------------------------------

Integrate jcseg with solr to deal with chinese tokenizer

1.cp jcseg-core-1.9.5.jar and jcseg-solr-1.9.5.jar to solr-4.7.0/example/solr-webapp/webapp/WEB-INF/lib/

2. cp lexicon dir to solr-4.7.0/example/solr-webapp/webapp/WEB-INF/lib/

3. alter schema.xml add fildtype

<fieldtype name="textComplex" class="solr.TextField">
   <analyzer>
       <tokenizer class="org.lionsoul.jcseg.solr.JcsegTokenizerFactory" mode="complex"/>
       <filter class="solr.StopFilterFactory" ignoreCase="true" words="lang/stopwords_ch.txt"/>
    </analyzer>
</fieldtype>

猜你喜欢

转载自ylzhj02.iteye.com/blog/2089976

Solr:Text analysis

solr analysis页面分析

NLP-Text Classifiers for Sentiment Analysis

深入理解Elasticsearch专题：Text Analysis

【Solr】Schema.xml and solrconfig.xml analysis

solr 4 分词报错 This Functionality requires the /analysis/field Handler to be regist

第七章分词器：Text Analysis

中文文本分析, Text-Analysis

(Network Analysis)Link Analysis

Codeforces Round #375 (Div. 2) B - Text Document Analysis 模拟

论文笔记：Digital Watermarking Technique for Text Document Protection Using Data Mining Analysis

阅读笔记001.《Parameter estimation for text analysis》- Gregor Heinrich 论文笔记

Solving Aspect Category Sentiment Analysis as a Text Generation Task论文阅读（EMNLP2021）

Regression Analysis

Analysis Patterns

Error Analysis

Analysis of Algorithms

Procrustes analysis

Analysis method

video analysis

Numerical Analysis

Project Analysis

Analysis servlet

Servlet Analysis

Analysis of Servlet

CDI Analysis

Analysis CDI

Log Analysis

解析-analysis

algorithmic analysis

今日推荐

美国拟限制 AI 大模型出口中国和俄罗斯

苹果将与 OpenAI 达成协议，将 ChatGPT 应用于 iPhone

openKylin 社区生态委员会第六次会议圆满召开

阿里云正式发布通义千问 2.5

Python 3.13 发布首个 Beta：实验性自由线程模式和 JIT、改进交互式解释器

Stack Overflow 拿我的代码去训练 AI 大模型，还封了我的账号

Pop!_OS 的 COSMIC 桌面完成 App Store 上架工作

报告：Django 仍然是 74% 开发者的首选

《2024 年一季度互联网投融资运行情况》研究报告

15 年前上了“FFmpeg 耻辱柱”，今天他还得谢谢咱——腾讯QQPlayer一雪前耻？

TIOBE 5 月榜单：Fortran “复活”进入 Top 10

GCC 14.1 发布

周排行

curl的POST请求，封装方法

8.1.1. Integer Types

Java基础 Day05(个人复习整理)

Python - Django - 中间件 process_exception

小L的试卷

【Shell编程】（函数）判断用户是否存在

python(css样式)

spring ant path 匹配原则 - 【笔记】

《JavaScript与JScript从入门到精通》(美)James.Jaworski.中译本.扫描版.pdf

Eclipse运行带参数的java程序

每日归档

更多

2024-05-12(0)

2024-05-11(38)

2024-05-10(38)

2024-05-09(35)

2024-05-08(42)

2024-05-07(14)

2024-05-06(40)

2024-05-05(0)

2024-05-04(7)

2024-05-03(19)