Skip to content

Latest commit

 

History

History
116 lines (68 loc) · 3.26 KB

README.md

File metadata and controls

116 lines (68 loc) · 3.26 KB

dbf-parser

This is an event-based dBase file parser for very efficiently reading data from *.dbf files.

Based on https://github.com/abstractvector/node-dbf but works without coffee and provide float type of numbers. F.e. correct work with VMap0 geonames base (http://gis-lab.info/data/vmap0-settl-rus/population-gislab.7z)

To get started, simply install the module using npm:

npm install dbf-parser

and then require it:

var Parser = require('dbf-parser');

#Classes

There are two classes - the Parser and the Header. The Parser is the most interesting class.

##Parser

This class is the main interface for reading data from dBase files. It extends EventEmitter and its output is via events.

###new Parser(path)

  • path String The full path to the .dbf file to parse

Creates a new Parser and attaches it to the specified filename.

var Parser = require('node-dbf');

var parser = new Parser('/path/to/my/dbase/file.dbf');

###parser.on(event, listener)

  • event String The event name to listen for (see below for details)
  • listener Function The callback to bind to the event

This method is inherited from the EventEmitter class.

###parser.parse()

Call this method once you have bound to the events you are interested in. Although it returns the parser object (for chaining), all the dBase data is outputted via events.

parser.parse();

###Event: 'start'

  • parser Parser The parser object

This event is emitted as soon as the parser.parse() method has been invoked.

###Event: 'header'

  • header Header The header object as parsed from the dBase file

This event is emitted once the header has been parsed from the dBase file

###Event: 'record'

  • record Object An object representing the record that has been found

The record object will have a key for each field within the record, named after the field. It is trimmed (leading and trailing) of any blank characters (dBase files use \x20 for padding).

In addition to the fields, the object contains two special keys:

  • @sequenceNumber Number indicates the order in which it was extracted
  • @deleted Boolean whether this record has been deleted or not

This object may look like:

{
    "@sequenceNumber": 123,
    "@deleted": false,
    "firstName": "John",
    "lastName": "Smith
}

###Event: 'end'

  • parser Parser The parser object

This event is fired once the dBase parsing is complete and there are no more records remaining.

##Usage

The following code example illustrates a very simple usage for this module:

var Parser = require('node-dbf');

var parser = new Parser('/path/to/my/dbase/file.dbf');

parser.on('start', function(p) {
    console.log('dBase file parsing has started');
});

parser.on('header', function(h) {
    console.log('dBase file header has been parsed');
});

parser.on('record', function(record) {
    console.log('Name: ' + record.firstName + ' ' + record.lastName); # Name: John Smith
});

parser.on('end', function(p) {
    console.log('Finished parsing the dBase file');
});

parser.parse();

#TODO

  • Write some tests
  • Add support for field types other than Character and Numeric
  • Use fs.readStream instead of fs.readFile for increased performance