crawler17.3kMIT2.0.2Crawler is a ready-to-use web spider that works with proxies, asynchrony, rate limit, configurable request pools, jQuery, and HTTP/2 support.
bda-researchabout 2 months agocrawler, javascript, spider, scraper @nodelib/fs.walk264.3mMIT3.0.1A library for efficiently walking a directory recursively
nodelib8 months agocrawler, NodeLib, fs, FileSystem fdir93mMIT6.4.6The fastest directory crawler & globbing alternative to glob, fast-glob, & tiny-glob. Crawls 1m files in < 1s
thecodrrabout 1 month agocrawler, util, os, sys isbot4.9mUnlicense5.1.28🤖/👨🦰 Recognise bots/crawlers/spiders using the user agent string.
omrilotan2 months agocrawlers, bot, spiders, googlebot pdf-parse4.2mMIT1.1.1Pure javascript cross-platform module to extract text from PDFs.
autokentabout 3 years agopdf-crawler, pdf-parse, xpdf, pdf.js