elasticsearch-cn / elasticsearch-definitive-guide Goto Github PK

View Code? Open in Web Editor NEW

This project forked from elastic/elasticsearch-definitive-guide

4.8K 4.8K 1.5K 8.76 MB

欢迎加QQ群：109764489，贡献力量！

Home Page: https://www.elastic.co/guide/cn/elasticsearch/guide/current/index.html

License: Other

Perl 15.67% HTML 53.93% CSS 19.95% XSLT 10.45%

elasticsearch-definitive-guide's People

Contributors

$polyfractal avatar$

Stargazers

Watchers

Forkers

wanghaisheng hysios 756613351 evanyellow lxy4java abia321 newlifehejian lywangwenbin zcola birdben thanq pengqiuyuan medcl songlinx sunyonggang ggchangan xlows-1227 rainandlzm luotitan richardwei2008 biyuhao mythslove lishuaigit chenryn yufeng9006 xuej gaodoo limjoe jayranbutong dajyaretakuya rucky2013 weiqiangyuan xiaoguanyu maplecms jessicawon chilly gitqh davidmr001 bsll donglangdtstack dreamer2008 wangxiuwen alexlove77 fanyer calm4wei yichao2015 leo650 trestea josephjin tangmisi lephix miranda21 kayangredhat kaichuang1992 blogsit weikuo0506 dongcheng javasgl josephjinshoajian liugangr echolihao feuyeux pythonshell kankedong wharstr9027 angryz sdlyjzh stormdush michealzh qindongliang candythinking wangqi811 smilesfc exceptions alixmu fredlliao linenlin01 wypb yuanfeiasima xhalower node lldaaron hamiltonisbest bluerocly chuanchang mslycn pkusnail williamdeve fly365 smartan jaychang9 zhijieqing czjxy881 ronyuzhang ipsolar zhangxiaoguang-baidu bsglz bbirdsky cham-space citysir

elasticsearch-definitive-guide's Issues

es5.2根据请求来源端口缓存数据？

环境 : 系统 linux
jdk 1.8
es 5.2
tomcat 6.0
maven 3.3.9
JAVA_OPTS="-Dcom.sun.management.jmxremote.port=xxx -Dcom.sun.management.jmxremote.ssl=false -Dcom.sun.management.jmxremote.authenticate=false -Djava.rmi.server.hostname=xxx.xxx.xxx.xxx -Djava.awt.headless=true -Djava.library.path=/home/elasticsearch/jdk1.8/jre/bin -Djmagick.systemclassloader=no -server -Xmx3072m -Xms3072m -XX:NewSize=768m -XX:MaxNewSize=768m -Xss512k -XX:+UseConcMarkSweepGC -XX:+CMSIncrementalMode -XX:+CMSIncrementalPacing -XX:+UseCMSCompactAtFullCollection -XX:CMSIncrementalDutyCycleMin=0 -XX:CMSIncrementalDutyCycle=10 -XX:PermSize=128m -XX:MaxPermSize=128m -XX:+PrintGCDetails -XX:+PrintGCTimeStamps -XX:-TraceClassUnloading";

   同样的代码 同一个tomcat,通过不同的server.xml启动两次，只修改了server.xml中的端口号 ，包括shutdown http ajp redirect 四个端口，其它tomcat配置文件均未修改。

问题：两个端口从es中拿到的数据不一致。
开始我的tomcat用的是80端口,调试时发现页面数据缺失,检查后发现是从es中拿数据的逻辑发生错误，修改后部署，线上页面的数据仍然是老数据，没有更新。

已排除的可能性
1. ngnix
项目配置了ngnix，但是我通过直接访问服务器的ip+port/project的形式是不走ngnix的，数据仍然是老数据，排除ngnix缓存可能。
2. redis
开始我的程序是加了redis缓存的，后来为了检查这个问题，已经去掉了从es拿数据这部分缓存，重新部署，数据仍然是老数据。我将服务器中跑的项目的class文件down到本地，反编译后，内容中确实已经去掉了redis缓存。
3. tomcat
删除了work目录下所有文件，页面仍然是老数据
4. es我了解到的缓存
http://localhost:9200/twitter/_cache/clear
http://localhost:9200/twitter/_cache/clear?filter=true
http://localhost:9200/twitter/_cache/clear?filter=true&fields=cphm,cplx
以上三种都试过，执行成功但页面仍然是老数据
5.不是项目本身的问题
从服务器上拿到tomcat运行的war包，放在本地执行均为正确的新数据
切换tomcat端口，仅修改了server.xml中端口，然后启动同一个tomcat,页面显示数据为正确的新数据。
6.重启过es服务器
旧端口仍然是旧数据

  以上，我调试阶段是在本地调试，本地都是最新的数据。那么缓存究竟出在什么地方，是es根据请求来源做了缓存，并且落地了，还是有可能什么其它问题。

  我在程序中打印了从es拿到的数据，发现不同端口启动的tomcat，日志内容也是不一样的，旧端口的日志显示从es拿到的数据就是缺失的。

  ----------------------------------------------------------------------------
  我重新配置了tomcat ，测试了旧端口，仍然拿不到数据，排除了tomcat配置原因，缓存究竟出在哪。

we need a README file

管理监控部署章节拼写错误

部署章节的“重要配置修改部分”最小主节点数里边的 data 拼写成了 date

45_Partial_update_错误

这个例子执行报错。

POST /website/blog/1/_update
{
   "script" : "ctx.op = ctx._source.views == count ? 'delete' : 'none'",
    "params" : {
        "count": 1
    }
}

我改成这样OK了：

POST /website/blog/1/_update
{
   "script" : "ctx.op = ctx._source.views == count ? \"delete\" : \"none\"",
    "params" : {
        "count": 1
    }
}

写文档请确保示例都是可以运行的。

ES安装plugin插件命令改变

Windows环境下es 5.6.2版本中, 安装插件的命令是 kibana-plugin.bat, 请在以下文档中做更新 install sense

bin\kibana-plugin.bat install elastic/sense

求解？elasticsearch 9300连接不上 java，已经花费几天时间

环境：客户服务器
Windows Server 2012 R2/6.3/amd64
elasticsearch6.1.2
jdk 1.8.0.2
后台程序无异常，@2打印正常，@3不打印
System.out.println("@2"+ip+":"+port);
			this.client = new PreBuiltTransportClient(settings).addTransportAddress(new TransportAddress(InetAddress.getByName(ip), port));
			System.out.println("@3"+client.nodeName());
以上问题弄了很久不得其解，各种测扔没找到问题求各位解答，谢了
本机上装的es同样的环境win7 一切正常。

RestHighLevelClient 连接elasticsearch需要token

ElasticsearchStatusException[Elasticsearch exception [type=security_exception, reason=missing authentication token for REST request [/]]]

请问，应该怎么把用户名和密码添加进去

使用curl调用api需要加Content-Type头

Elasticsearch: 权威指南 » 基础入门 » 你知道的, 为了搜索… » 和 Elasticsearch 交互

curl示例中，需要指定Content-Type。

curl -H 'Content-Type: application/json' -X GET 'http://localhost:9200/_count?pretty' -d '
{
    "query": {
        "match_all": {}
    }
}
'

翻译纠错

数据建模-通过子文档查询父文档, 最后：

has_child 查询和过滤在运行机制上类似，区别是 has_child 过滤不支持 source_mode 参数。has_child 过滤仅用于筛选内容--如内部的一个 filtered 查询--和其他过滤行为类似：包含或者排除，但没有进行评分。
has_child 过滤的结果没有被缓存，但是 has_child 过滤内部的过滤方法适用于通常的缓存规则。

source_mode应该是score_mode

在数据建模-通过父文档查询子文档

虽然 nested 查询只能返回最顶层的文档，但是父文档和子文档本身是彼此独立并且可被单独查询的。我们使用 has_child 语句可以基于子文档来查询父文档，使用 has_parent 语句可以基于子文档来查询父文档。

”使用 has_parent 语句可以基于子文档来查询父文档“这句话应该改为“使用 has_parent 语句可以基于父文档来查询子文档”

一处错字

文件：510_Deployment/40_config.asciidoc
错误地方：
但它应该永远不被使用在生产环境了，否在你得到的结果就是一个节点意外的加入到了你的生产环境，仅仅是因为他们收到了一个错误的组播信号。其中否在应该是否则

我想自己把该项目转出PDF该怎么做？

我想转出PDF来看，虽然我知道官网里有，但是最新的毕竟是源码库。本人只用于学习，目前正在学习elasticsearch，自己也翻译版本5的文档，当然没有你们翻译的好

mget如何查看took时间

mget查询doc，在结果中没有显示took时间，而search是可以获取该时间，用于定位问题，mget是否支持

映射和分析这一篇开头的例子好像已经不能用了

使用了如下例子进行模拟, 根本查询不出来

POST /gb/tweet/_bulk
{ "index": { "_id": 1 }}
{ "name" : "高性能MySQL", "user_id" : "15", "tweet" : "page's number is gt than 700" , "create_at":"2015-09-25"}
{ "index": { "_id": 2 }}
{ "name" : "Elasticsearch 权威指南", "user_id" : "5", "tweet" : "this is a very nice book" , "create_at":"2018-02-11"}
{ "index": { "_id": 3 }}
{ "name" : "Solr 实战", "user_id" : "1098", "tweet" : "my next book" , "create_at":"2017-06-06"}
{ "index": { "_id": 4 }}
{ "name" : "Kafka 内核分析", "user_id" : "79", "tweet" : "kafka need to compare with rabbitmq" , "create_at":"2016-08-06"}

如下查询貌似没有结果, 感觉 _all 已经变了, 我用的是 est6

GET /gb/tweet/_search?q=2015

网上查过了, 有人说est6需要使用 copy_to, 但依然无效

cancel

potential return value change in XHEAD operation

030_Data/20_Exists.asciidoc

in this chapter ,when check the document is exist , send
curl -i -XHEAD http://localhost:9200/website/blog/123

the elastic search returns different with the result offered in doc.
as
HTTP/1.1 200 OK
content-type: application/json; charset=UTF-8
content-length: 185

Notice that content-length is not zero and the request doesn't end properly , Ctrl+C is needed to exit the request.

unexpected error during fresh start by docs

010_Intro/35_Tutorial_Aggregations.asciidoc

here when excute the example:

{
"aggs": {
"all_interests": {
"terms": { "field": "interests" }
}
}
}
'

elastic search returned :
{
"error" : {
"root_cause" : [
{
"type" : "illegal_argument_exception",
"reason" : "Fielddata is disabled on text fields by default. Set fielddata=true on [interests] in order to load fielddata in memory by uninverting the inverted index. Note that this can however use significant memory. Alternatively use a keyword field instead."
}
],
"type" : "search_phase_execution_exception",
"reason" : "all shards failed",
"phase" : "query",
"grouped" : true,
"failed_shards" : [
{
"shard" : 0,
"index" : "megacorp",
"node" : "3uJClDOmRmKlDjKWYMsYRQ",
"reason" : {
"type" : "illegal_argument_exception",
"reason" : "Fielddata is disabled on text fields by default. Set fielddata=true on [interests] in order to load fielddata in memory by uninverting the inverted index. Note that this can however use significant memory. Alternatively use a keyword field instead."
}
}
]
},
"status" : 400
}

i think it is caused by some version change in default settings , so the doc needs improvement , add corresponding settings before this chapter :)

请问有PDF下载吗

请问有PDF下载吗,

中文版里面的搜索示例代码过时了

比如： ./080_Structured_Search/10_compoundfilters.asciidoc 文件里，关于filtered使用已经在5.0版本过时了。

参考链接： https://stackoverflow.com/questions/40519806/no-query-registered-for-filtered

使用HighLevelClient搜索ES服务器，用Jmeter压测200个线程时响应很慢，不支持高并发么？

用Jmeter压测时，每个线程第一次请求都会耗时800ms左右，这样200个线程就需要等待160s。第一次请求都慢，第二次请求就会很快响应，大概20ms，为什么会出现这种情况？我已经禁用了ping和嗅探，求指点。

使用分词器搜索，highlight里面搜索结果为什么有时候是片段，而不是整个字段内容

使用分词器搜索，highlight里面搜索结果为什么有时候是片段，而不是整个字段内容：
{
"took": 4,
"timed_out": false,
"_shards": {
"total": 5,
"successful": 5,
"failed": 0
},
"hits": {
"total": 1,
"max_score": 10.979511,
"hits": [
{
"_index": "ik-test",
"_type": "ik-test-doc",
"_id": "1501837223",
"_score": 10.979511,
"_source": {
"content": "豹纹陆龟(学名：Stigmochelys pardalis)，别名豹龟，属龟鳖目、陆龟科、豹龟属爬行动物。背甲长可达68厘米，头颈黄棕色无斑，前额鳞1-2枚，顶鳞为数枚小鳞，背甲为深浅相套的杂色，腋盾2枚，胯盾1枚，与股盾相接[1] 。雄性豹龟的体形比雌性龟大。曾经属于Geochelone属。栖息地为干燥草原及灌木丛，需要广阔的室内外活动空间。喜欢在半干燥、带荆棘的草原上生活。在炎热的季节里，它会夏眠；一般12-15年豹龟会达到性成熟。龟产蛋能力是很强的，一些母龟一季能产3窝或更多。已大量人工养殖，完全素食，但家庭饲养需定期补充钙粉及综合维生素。豹纹陆龟大多分布在非洲南部，亦广泛分布在撒哈拉以南非洲，但西非和中非的大部分地方是寻找不到它们的踪影的。"
},
"highlight": {
"content": [
"。雄性豹龟的体形比雌性龟大。曾经属于Geochelone属。栖息地为干燥草原及灌木丛，需要广阔的室内外活动空间。喜欢在半干燥、带荆棘的草原上生活。在炎热的季节里，它会夏眠；一般12-15年豹龟会达到性成熟。龟产蛋能力是很强的，一些"
]
}
}
]
}

has_parent 使用问题

使用has_parent时，是否如下呢：

        hasParentQuery := elastic.NewHasParentQuery(parentType, **)
	searchResult, err := elasticsearch.Client.Search().
		Index(childIndex).
		Type(childType).
		Query(hasParentQuery).
		Pretty(true).
		Do(**)

查询的索引为 child index 根据 parent 查询条件，获得相应的 child 数据？

清理下elastic-翻译小组群不活跃/or贡献少成员吧，给希望加入做贡献的人一些空间

如题，@medcl

示例中的curl -XHEAD命令不合理

elastic/elasticsearch#33490
使用curl -XHEAD调用，命令会不结束
参照上述issues,建议使用curl -Xhead或curl -HEAD

关于如何在本地开启的问题

这个项目是用gitbook做的吗？

我用gitbook serve运行，提示没有README.md文件，请问我要如何在本地搭起一本电子书？

因为网络不行，在网上看特别费劲，希望有人能够解答，谢谢！！

真的很需要一个README

请问能加入一个README提示一下，完全不知道应该怎么看这个文档。多谢

guide/080_30 use missing query,but missing query has been removed in 5.5

缺失查询中使用了missing

"filter": {
   "missing" : { "field" : "tags" }
  }

但是新版es5.5英文文档中显示已经移除了missing，应该使用must_not exists

教程里的聚合例子在elasticsearch 6.0.1下的写法

原文链接：https://www.elastic.co/guide/cn/elasticsearch/guide/current/_analytics.html

原例子：
GET /megacorp/employee/_search
{
"aggs": {
"all_interests": {
"terms": { "field": "interests" }
}
}
}

在elasticsearch 6.0.1下执行会报如下错误：
{
"error": {
"root_cause": [
{
"type": "illegal_argument_exception",
"reason": "Fielddata is disabled on text fields by default. Set fielddata=true on [interests] in order to load fielddata in memory by uninverting the inverted index. Note that this can however use significant memory. Alternatively use a keyword field instead."
}
],
"type": "search_phase_execution_exception",
"reason": "all shards failed",
"phase": "query",
"grouped": true,
"failed_shards": [
{
"shard": 0,
"index": "megacorp",
"node": "S_-UcyJiRGmZvt1vDN-HjA",
"reason": {
"type": "illegal_argument_exception",
"reason": "Fielddata is disabled on text fields by default. Set fielddata=true on [interests] in order to load fielddata in memory by uninverting the inverted index. Note that this can however use significant memory. Alternatively use a keyword field instead."
}
}
]
},
"status": 400
}

有两种方法可以解决，一种是设置fielddata=true，另一种是使用keyword，如下：
GET /megacorp/employee/_search
{
"aggs": {
"all_interests": {
"terms": {
"field": "interests.keyword"
}
}
}
}

官方文档：
https://www.elastic.co/guide/en/elasticsearch/reference/current/fielddata.html

Spring boot 2.0集成Elasticsearch

使用继承ElasticsearchRepository 连接外网的elasticsearch连不上报org.elasticsearch.client.transport.NoNodeAvailableException

图片为什么都挂了

很多图片都挂了啊看不了

请问如何编译成想要的格式

如题对asciidoc 不太熟悉请教

1.1.6搜索，使用DSL语句查询（GET错误）

1.1.6搜索，使用DSL语句查询
GET /megacorp/employee/_search
如果要传一个json请求体，不应该用GET方法，应该使用POST方法

CANNOT OPEN ALL THE PICS

110_Multi_Field_Search/30_Most_fields拼写错误

如果一个用户搜索 “quick brown box”应该是"fox"

請問能打包成成電子書的格式嗎?，如 mobi

請問有没有方法編譯成 mobi，使用kindle閱讀

看了英文版的介紹，只有打包成 html 的方法

query_string多条件检索说明有误

20_Query_string.asciidoc中说 “+”表示要同时满足检索条件

那么 q=age:25+gender:female 就应该检索出 age=25 并且 gender=female的文档。实际结果是age=25或者gender=female都检索出来了

steam的原理

我想了解下:
1.日志“tail -f”功能依赖filebeat，那么filebeat是一定要直接输出日志到ES吗？如果filebeat输出到logstash，由logstash输出到ES，“tail -f”能正常使用吗?
2.日志tail -f （stream）读取的数据是从ES读取，还是由filebeat实时上传？

when i run the code that given by https://gist.github.com/clintongormley/8579281 then i got this problem

{
"error": {
"root_cause": [
{
"type": "illegal_argument_exception",
"reason": "Rejecting mapping update to [gb] as the final mapping would have more than 1 type: [tweet, user]"
}
],
"type": "illegal_argument_exception",
"reason": "Rejecting mapping update to [gb] as the final mapping would have more than 1 type: [tweet, user]"
},
"status": 400
}

我用的是7.1.1版本语法不一样了吗？

GET /megacorp/employee/_search
{
"query" : {
"match" : {
"last_name" : "Smith"
}
}
}
这个不好使换成POST就好使了

翻译

取回一个文档????
翻译太low了吧

文档的部分更新（illegal_argument_exception）

POST /website/blog/1/_update
{
"script" : "ctx._source.tags+=new_tag",
"params" : {
"new_tag" : "search"
}
}
会报 illegal_argument_exception。
此处中的"script" : "ctx._source.tags+=new_tag",修改为"script" : "ctx._source.tags+=params.new_tag",
get以后得到的值为null。

通过实践，发现params中的所有变量都无法拿到。都是报illegal_argument_exception。

新版的文档，比起旧版来，好似没有一个地方可以查看目录啊

https://www.elastic.co/guide/cn/elasticsearch/guide/current/index.html

新版的文档比起以前gitbook的文档，没有地方来查看整个目录，在特定的查找时反而不方便了

关于精确查找

https://github.com/elasticsearch-cn/elasticsearch-definitive-guide/blob/cn/080_Structured_Search/20_contains.asciidoc
中提到的“{ "tags" : ["search", "open_source"], "tag_count" : 2 }”好像不能达到所说的精确匹配吧？因为terms查找是一个满足即可，所以上面的查询语句也可以匹配{ "tags" : ["search", "others"], "tag_count" : 2 }这种文档？