es 啟動需要分配超過2.6g的內(nèi)存
默認端口9200,9300 ;
9200為http端口,9300為tcp端口
springboot-data-es 默認連接9300進行操作 存在問題:容易因為es版本不一致而無法啟動
所以選擇使用在項目中使用http訪問
searchbox 發(fā)送http請求的client
elasticsearch 主要用于寫es查詢體
lucene-core es的核心包,當啟動es后訪問9200會出現(xiàn)該包的版本號
引入pom
<dependency>
<groupId>io.searchbox</groupId>
<artifactId>jest</artifactId>
<version>5.3.3</version>
</dependency>
<dependency>
<groupId>org.elasticsearch</groupId>
<artifactId>elasticsearch</artifactId>
<version>5.6.16</version>
</dependency>
<!-- https://mvnrepository.com/artifact/org.apache.lucene/lucene-core -->
<dependency>
<groupId>org.apache.lucene</groupId>
<artifactId>lucene-core</artifactId>
<version>6.6.1</version>
</dependency>
yml 配置
spring:
elasticsearch:
jest:
uris: http://localhost:9200
read-timeout: 20000 #讀取超時
connection-timeout: 20000 #連接超時
具體使用
創(chuàng)建entity
public class EntityDo implements Serializable {
//庫名
public static final String INDEX_NAME = "test";
//表名
public static final String TYPE = "entity";
private Integer id;
private String workerid;
private String content;
}
service
@Service
public class CaseService {
@Autowired
private JestClient jestClient;
//批量插入
public void saveEntity(List<EntityDo> EntityDos) {
Bulk.Builder bulk = new Bulk.Builder();
for(EntityDo entityDo: EntityDos) {
Index index = new Index.Builder(entityDo).index(EntityDo.INDEX_NAME).type(CaseDo.TYPE).build();
bulk.addAction(index);
}
try {
jestClient.execute(bulk.build());
} catch (IOException e) {
e.printStackTrace();
}
}
public List<String> searchFetch(String content){
//構(gòu)造查詢體
SearchSourceBuilder searchSourceBuilder = new SearchSourceBuilder();
//匹配查詢content 最小匹配度75%
searchSourceBuilder.query(QueryBuilders.matchQuery("content",content).minimumShouldMatch("75%"));
//聚合查詢 將workerid作為主體 分組查詢workerid出現(xiàn)最多的10條數(shù)據(jù)
TermsAggregationBuilder aggregationBuilder = AggregationBuilders.terms("workerid_count").field("workerid.keyword").size(10);
//將聚合查詢加入查詢體中
searchSourceBuilder.aggregation(aggregationBuilder);
//根據(jù)查詢體,庫名表名 創(chuàng)建查詢
Search search = new Search.Builder(searchSourceBuilder.toString())
.addIndex(EntityDo.INDEX_NAME).addType(EntityDo.TYPE).build();
List<String> workids = new ArrayList<>();
try {
//發(fā)送請求
JestResult result = jestClient.execute(search);
//請求成功
if (result.isSucceeded()){
//從Agg中獲取聚合查詢中的結(jié)果
List<TermsAggregation.Entry> workerid_counts = ((SearchResult) result).getAggregations().getTermsAggregation("workerid_count").getBuckets();
for (TermsAggregation.Entry entry: workerid_counts
) {
workids.add(entry.getKeyAsString());
}
}
} catch (IOException e) {
e.printStackTrace();
}
return workids;
}
踩過的坑
關(guān)于hit與agg
es的查詢,如果帶有聚合查詢就會返回帶有agg的結(jié)果,通過遍歷獲取agg的內(nèi)容即可獲得聚合值
es的查詢無論是普通的匹配查詢還是聚合查詢 都會帶有hit值,hit表示所有滿足查詢條件的結(jié)果,里面不是聚合后的結(jié)果?。。?!
之前不理解這2個關(guān)系,所以在java代碼中聚合了hit以獲得正確的解;但是通過驗證agg中的值,結(jié)合es是用java寫的,而且es的返回結(jié)果無論如何都帶有hit數(shù)據(jù),所以我認為我們發(fā)送的聚合查詢的本質(zhì)就是,es先查詢出hit值,然后用java代碼實現(xiàn)聚合后,將值加入agg后返回