nutch solrindex添加1个文档

时间:2018-07-06 06:21:31

标签: pdf solr web-crawler nutch

我尝试使用Nuct mysql solrindex添加pdf,但是仅添加了一个文档。

已解析的数据足够大。有什么问题吗?

0/0 spinwaiting/active, 50 pages, 0 errors, 3.3 3 pages/s, 7994 6968 kb/s, 0 URLs in 0 queues
-activeThreads=0
ParserJob: resuming:    false
ParserJob: forced reparse:      false
ParserJob: parsing all
Parsing http://www.adb.org/sites/default/files/cross-debarment-agreement.pdf
......
cilab@cilab:~/workspace-jupyter/#Members/JongYum/nutch/runtime/local$ bin/nutch solrindex http://localhost:8983/solr/demo -allSolrIndexerJob: starting
Adding 1 documents
SolrIndexerJob: done.

0 个答案:

没有答案