Elasticsearch version (bin/elasticsearch --version): 7.4.1
Plugins installed: ["mapper-size"]
JVM version (java -version): latest docker image.
OS version (uname -a if on a Unix-like system): latest docker image.
Description of the problem including expected versus actual behavior:
I'm doing an external reindex from Elasticsearch 5.6.16 to 7.4.1 and Its working great except for the one index that has 3 million documents in it and I'm randomly getting a parse exception on.... Sure would be great to know what document it is so I could look at it....
I'm not sure on the steps to reproduce other than start an external reindex and get the task status (how I'm seeing this error). If I knew the document I could post it with the mapping.
Provide logs (if relevant):
Successful (200) low level call on GET: /_tasks/hjNNxRi_SQSSihj20T6Vbg%3A4889?pretty=true&error_trace=true&wait_for_completion=false
# Audit trail of this API call:
- [1] HealthyResponse: Node: http://localhost:9200/ Took: 00:00:00.0110890
# Request:
<Request stream not captured or already read to completion by serializer. Set DisableDirectStreaming() on ConnectionSettings to force it to be set on the response.>
# Response:
{
"completed" : true,
"task" : {
"node" : "hjNNxRi_SQSSihj20T6Vbg",
"id" : 4889,
"type" : "transport",
"action" : "indices:data/write/reindex",
"status" : {
"total" : 50501,
"updated" : 0,
"created" : 2000,
"deleted" : 0,
"batches" : 2,
"version_conflicts" : 0,
"noops" : 0,
"retries" : {
"bulk" : 0,
"search" : 0
},
"throttled_millis" : 0,
"requests_per_second" : -1.0,
"throttled_until_millis" : 0
},
"description" : "reindex from [host=docker.for.mac.localhost port=9210 pathPrefix=/ query={\n \"match_all\" : {\n \"boost\" : 1.0\n }\n}][test-search-v44] to [local-test-search-v44][_doc]",
"start_time_in_millis" : 1571866634051,
"running_time_in_nanos" : 353592096200,
"cancellable" : true,
"headers" : { }
},
"error" : {
"type" : "exception",
"reason" : "Error parsing the response, remote is likely not an Elasticsearch instance",
"caused_by" : {
"type" : "x_content_parse_exception",
"reason" : "[1:6032145] [search_response] failed to parse field [hits]",
"caused_by" : {
"type" : "x_content_parse_exception",
"reason" : "[1:6032145] [hits] failed to parse field [hits]",
"caused_by" : {
"type" : "x_content_parse_exception",
"reason" : "[1:6032145] [hit] failed to parse field [_source]",
"caused_by" : {
"type" : "parsing_exception",
"reason" : "[hit] failed to parse [_source]",
"line" : 1,
"col" : 6032145,
"caused_by" : {
"type" : "json_parse_exception",
"reason" : "Duplicate field 'comments'\n at [Source: org.apache.http.nio.entity.ContentInputStream@1ea0ba31; line: 1, column: 6032160]",
"suppressed" : [
{
"type" : "illegal_state_exception",
"reason" : "Failed to close the XContentBuilder",
"caused_by" : {
"type" : "i_o_exception",
"reason" : "Unclosed object or array found"
}
}
]
}
}
}
}
}
}
}