我正在编写一个简短的Node.js片段,该片段分析RSS提要,提取链接,将它们重新配置为所需的PDF链接,然后写入这些文件。代码如下:
var https = require('https');
var fs = require('fs');
const Parser = require("rss-parser");
let parser = new Parser();
parser.parseURL("https://regulations.justia.com/regulations/fedreg?limit=20&mode=atom")
.then((feed) => {
const base = "https://docs.regulations.justia.com/entries"
feed.items.forEach((item, i) => {
// Parsing to create PDF link...
const str = item.link;
let dates = str.substring(50, 60);
let newDates = dates.replace(/\//, "-").replace(/\//, "-");
let ending = str.substring(61).replace(".html",".pdf");
let fullString = `${base}/${newDates}/${ending}`;
// Fetching and saving the PDF file....
const file = fs.createWriteStream(`${item.title}.pdf`);
const request = https.get(fullString, (res) => {
res.pipe(file);
});
});
})
.catch((err) => console.log(err));
我现在遇到两个错误。
1)与我的可写流有关。当我尝试根据RSS Feed中的item.title
创建文件时,每次都会出现此错误:
Error: ENOENT: no such file or directory, open 'Notice - Solicitation of Nominations for Appointment to the World Trade Center Health Program Scientific/Technical Advisory Committee (STAC).pdf'
这与项目标题中的括号或破折号有关吗?如果没有,还有什么可能导致此问题?
2)当我确实更改代码(将可写流命名为更简单的名称)时,我的代码将引发以下错误:
Error: socket hang up
at TLSSocket.onHangUp (_tls_wrap.js:1135:19)
at Object.onceWrapper (events.js:313:30)
at emitNone (events.js:111:20)
at TLSSocket.emit (events.js:208:7)
at endReadableNT (_stream_readable.js:1056:12)
at _combinedTickCallback (internal/process/next_tick.js:138:11)
at process._tickCallback (internal/process/next_tick.js:180:9)
通常在我下载了许多PDF(但不是全部)后,就会引发此错误。为了克服这些错误,在此示例中我可以做些什么更改?谢谢您的帮助!
答案 0 :(得分:2)
问题是item.title
中的某些字符包含/
字符,该字符指示在这种情况下不存在的文件夹。
当您摆脱标题中的那些/
时,它起作用。例如
const file = fs.createWriteStream(`${item.title.replace('/', '-')}.pdf`);