Question

我正在努力学习solr，但有些技术问题令我感到困惑。我有一个基本结构的大文档：

url -> {Some giant json object}
url -> {another giant json object}
...
url -> {another giant json object}

并且有将近30,000个。我想将它们索引为solr。所以我创建了一个schema.xml，它包含所有可能的字段，以及是否已编制索引，多值等等。

我想知道下一步该做什么的一般结构是什么。我知道我必须索引文件，但是我是否分别对每一行使用curl命令？只是寻找更高层次的理解，因为在线资源对我来说有点混乱。

谢谢！

EDIT-- 终端命令是索引这些特定类型文件的最快方法吗？我更新了示例json文件的样子。

Answer 1

卷曲请求

curl 'http://localhost:8983/solr/collection1/update/json/docs'
'?split=/exams'
'&f=first:/first'
'&f=last:/last'
'&f=grade:/grade'
'&f=subject:/exams/subject'
'&f=test:/exams/test'
'&f=marks:/exams/marks'
 -H 'Content-type:application/json' -d '

这会将数据索引为

{
  "first": "John",
  "last": "Doe",
  "grade": 8,
  "exams": [
      {
        "subject": "Maths",
        "test"   : "term1",
        "marks":90},
        {
         "subject": "Biology",
         "test"   : "term1",
         "marks":86}
      ]
}'

要了解更多信息，请点击此链接 -

https://lucidworks.com/blog/2014/08/12/indexing-custom-json-data/

Solr索引多个json对象

1 个答案: