GitHub - pauljohncleary/url-info-scraper: Utility for retrieving a small amount of meta data from a URL

Library to retrieve meta data (title, favicon address etc) from a url

Install

$ npm install --save url-info-scraper

Usage

var urlInfoScraper = require('url-info-scraper');

urlInfoScraper('http://en.wikipedia.org/wiki/Wikipedia', function(error, linkInfo) {
  var title = linkInfo.title; //'Wikipedia - Wikipedia, the free encyclopedia'
});

The response is an object with the following properties:

{
  isWebResource: boolean, //true if the link is valid
  title: string, //title of the page requested
  mime: string, //content-type header of the page e.g. image/jpeg
  parsable: boolean, //false if the content type is 'application'
  tooLarge: boolean, //true if the link body is greater than 5MB
  faviconUrl: string //the url of the favicon for the root site, null if not found
}

Todo

Rewrite tests to use mocked resources instead of real urls
"Best image" support
Store additional metadata (response time etc.)
Screenshots
...?

Name		Name	Last commit message	Last commit date
Latest commit History 38 Commits
lib		lib
test		test
.DS_Store		.DS_Store
.editorconfig		.editorconfig
.gitattributes		.gitattributes
.gitignore		.gitignore
.jshintrc		.jshintrc
.travis.yml		.travis.yml
.yo-rc.json		.yo-rc.json
Gruntfile.js		Gruntfile.js
README.md		README.md
index.js		index.js
package.json		package.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Install

Usage

Todo

License

About

Releases

Packages

Contributors 3

Languages

pauljohncleary/url-info-scraper

Folders and files

Latest commit

History

Repository files navigation

Install

Usage

Todo

License

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages