Skip to content

Navigation Menu

Explore
By company size
By use case
By industry
View all solutions
Topics
- AI
- DevOps
- Security
- Software Development
- View all
Explore
- GitHub Sponsors
  Fund open source developers
- The ReadME Project
  GitHub community articles
Repositories
- Enterprise platform
  AI-powered developer platform
Available add-ons
Pricing

Search code, repositories, users, issues, pull requests...

Search

Clear

Search syntax tips

Provide feedback

We read every piece of feedback, and take your input very seriously.

Include my email address so I can be contacted

Saved searches

Use saved searches to filter your results more quickly

Name

Query

To see all available qualifiers, see our documentation.

You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. You switched accounts on another tab or window. Reload to refresh your session.

Dismiss alert

peterbencze / serritor Public

Notifications You must be signed in to change notification settings
Fork 15
Star 31

Code
Issues 2
Pull requests 3
Discussions
Actions
Wiki
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Discussions
Actions
Wiki
Security
Insights

Releases: peterbencze/serritor

Releases · peterbencze/serritor

Serritor 2.1.1

11 Jun 12:42

peterbencze

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.

GPG key ID: 4AEE18F83AFDEB23

Expired

Learn about vigilant mode.

Compare

Choose a tag to compare

Loading

Serritor 2.1.1 Latest

Latest

Fix bug where crawl seeds were fed to the frontier twice, resulting in incorrect crawl stats
Fix bug where crawl stats were not reset when the crawler was restarted after its state was restored
Update dependency versions

Assets 5

Loading

All reactions

Serritor 2.1.0

18 Jun 22:18

peterbencze

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.

GPG key ID: 4AEE18F83AFDEB23

Expired

Learn about vigilant mode.

Compare

Choose a tag to compare

Loading

Serritor 2.1.0

This release includes new features, improvements and changes to the existing API.

Changes in a nutshell:

Add helper class for finding text in response content
Refactor UrlFinder
Modify HTTP client so that it uses the same user-defined HTTP proxy as Selenium
Ignore authentication cookie when cookie authentication is not enabled
Use MutableCapabilities instead of DesiredCapabilities when configuring the browser

Assets 5

Loading

All reactions

Serritor 2.0.0

30 May 20:29

peterbencze

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.

GPG key ID: 4AEE18F83AFDEB23

Expired

Learn about vigilant mode.

Compare

Choose a tag to compare

Loading

Serritor 2.0.0

This major release includes a number of new features, bug fixes and changes to the existing API.
Changes in a nutshell:

Add internal proxy server to overcome Selenium limitations (no access to response headers etc.)
Add onBrowserInit callback to configure the browser before the crawling begins
Always call onStop even if an unhandled exception is thrown
Rename callbacks
Add detailed logging
Use slf4j instead of builtin logger
Add web API feature
... and more

Assets 5

Loading

All reactions

Serritor 1.6.0

04 Nov 19:48

peterbencze

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.

GPG key ID: 4AEE18F83AFDEB23

Expired

Learn about vigilant mode.

Compare

Choose a tag to compare

Loading

Serritor 1.6.0

This release adds the possibility to specify custom callbacks for crawl events.

Assets 5

Loading

All reactions

Serritor 1.5.0

02 Sep 21:52

peterbencze

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.

GPG key ID: 4AEE18F83AFDEB23

Expired

Learn about vigilant mode.

Compare

Choose a tag to compare

Loading

Serritor 1.5.0

This release includes bug fixes and a number of enhancements and new features.
Major changes in a nutshell:

Change the access modifier of the stop method
Add the possibility to download files
Add the possibility to retrieve response content type
Fix browser compatibility check exception when using HtmlUnitDriver
Add default URL finder creation method
Remove Selenium cookie synchronization
Add support for loading config from previously saved state
Add static methods for creating crawl requests with the default config

Assets 5

Loading

All reactions

Serritor 1.4.0

23 Jun 14:13

peterbencze

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.

GPG key ID: 4AEE18F83AFDEB23

Expired

Learn about vigilant mode.

Compare

Choose a tag to compare

Loading

Serritor 1.4.0

This release includes a number of bug fixes and improvements.

Assets 5

Loading

All reactions

Serritor 1.3.1

22 Apr 15:51

peterbencze

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.

GPG key ID: 4AEE18F83AFDEB23

Expired

Learn about vigilant mode.

Compare

Choose a tag to compare

Loading

Serritor 1.3.1

This release includes a new feature and changes to the existing API.
Changes in a nutshell:

Changes how the crawler is configured:
- Adds CrawlerConfigurationBuilder for building CrawlerConfiguration instances
- The configuration is passed to the crawler's constructor
Adds the possibility to download the file in onNonHtmlResponse callback

Please check the Wiki for more information.

Assets 5

Loading

All reactions

Serritor 1.3.0

16 Mar 23:35

peterbencze

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.

GPG key ID: 4AEE18F83AFDEB23

Expired

Learn about vigilant mode.

Compare

Choose a tag to compare

Loading

Serritor 1.3.0

This release includes new features, improvements and changes to the existing API.

New features in a nutshell:

Crawl domains: they specify the domains in which crawling is allowed
Crawl delay mechanisms: these can be used to determine the delay between each request
Url finder: it can be used to find URLs in HTML sources using regular expressions

Please check the Wiki for more information.

Assets 5

Loading

All reactions

Serritor 1.2.1

10 Feb 20:26

peterbencze

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.

GPG key ID: 4AEE18F83AFDEB23

Expired

Learn about vigilant mode.

Compare

Choose a tag to compare

Loading

Serritor 1.2.1

This release includes minor fixes and improvements (including changes to the API, please check the Wiki for more information).

Assets 5

Loading

All reactions

Serritor 1.2

18 Jul 21:29

peterbencze

Compare

Choose a tag to compare

Loading

Serritor 1.2

This release includes new features, bug fixes and major API modifications. Please check the documentation for more information.

Assets 5

Loading

All reactions

Previous 1 2 Next

Previous Next

Footer

© 2024 GitHub, Inc.

Footer navigation

Terms
Privacy
Security
Status
Docs
Contact

You can’t perform that action at this time.