Releases: apify/crawlee
Releases · apify/crawlee
v3.12.0
3.12.0 (2024-11-04)
Bug Fixes
.trim()
urls from pretty-printed sitemap.xml files (#2709) (802a6fe), closes #2698- ensure correct column order in CSV export (#2734) (b66784f), closes #2718
- ignore errors from iframe content extraction (#2714) (627e5c2), closes #2708
- update
fingerprintGeneratorOptions
types (#2705) (fcb098d), closes #2703
Features
v3.11.5
3.11.5 (2024-10-04)
Bug Fixes
- core: fix
forefront
request fetching in RQv2 (#2689) (03951bd), closes #2669 - core: respect
forefront
option inprolong-
anddeleteRequestLock
(#2690) (cba8da3), closes #2681 #2689 #2669 - core: check
.isFinished()
beforeRequestList
reads (#2695) (6fa170f) - core: accept
UInt8Array
inKVS.setValue()
(#2682) (8ef0e60) - core: trigger
errorHandler
for session errors (#2683) (7d72bcb), closes #2678 - core: decode special characters in proxy
username
andpassword
(#2696) (0f0fcc5) - http-crawler: avoid crashing when gotOptions.cache is on (#2686) (1106d3a)
- puppeteer: rename
ignoreHTTPSErrors
toacceptInsecureCerts
to support v23 (#2684) (f3927e6) - memory-storage: respect
forefront
option inRequestQueue
(#2681) (b0527f9), closes #2669
v3.11.4
v3.11.3
v3.11.2
3.11.2 (2024-08-28)
Bug Fixes
- RequestQueueV2: remove
inProgress
cache, rely solely on locked states (#2601) (57fcb08) - use namespace imports for cheerio to be compatible with v1 (#2641) (f48296f)
- Use the correct mutex in memory storage RequestQueueClient (#2623) (2fa8a29)
Features
globs
®exps
forSitemapRequestList
(#2631) (b5fd3a9)- resilient sitemap loading (#2619) (1dd7660)
This release is pinning the dependency on cheerio to the last RC version, we might postpone the official support for v1 to next major, or at least wait for them to fix their stuff. Nice demonstration of how not to maintain popular open source projects 😞
v3.11.1
v3.11.0
v3.10.5
3.10.5 (2024-06-12)
Bug Fixes
- allow creating new adaptive crawler instance without any parameters (9b7f595)
- declare missing peer dependencies in
@crawlee/browser
package (#2532) (3357c7f) - fix detection of HTTP site when using the
useState
in adaptive crawler (#2530) (7e195c1) - mark
context.request.loadedUrl
andid
as required inside the request handler (#2531) (2b54660)
v3.10.4
3.10.4 (2024-06-11)
Bug Fixes
- add
waitForAllRequestsToBeAdded
option toenqueueLinks
helper (925546b), closes #2318 - add missing
useState
implementation into crawling context (eec4a71) - make
crawler.log
publicly accessible (#2526) (3e9e665) - playwright: allow passing new context options in
launchOptions
on type level (0519d40), closes #1849 - respect
crawler.log
when creating child logger forStatistics
(0a0d75d), closes #2412
v3.10.3
3.10.3 (2024-06-07)
Bug Fixes
- adaptive-crawler: log only once for the committed request handler execution (#2524) (533bd3f)
- increase timeout for retiring inactive browsers (#2523) (195f176)
- respect implicit router when no
requestHandler
is provided inAdaptiveCrawler
(#2518) (31083aa) - revert the scaling steps back to 5% (5bf32f8)