Website: https://buildingid.pnnl.gov/
To complete this guide, Git and Python 3 are required. Dependencies are automatically installed using pip.
Clone the repository, and then install the pnnl-buildingid
package:
git clone https://github.com/pnnl/buildingid-py cd buildingid-py pip3 install -e .
Verify the location of the buildingid
command:
which buildingid #=> /usr/local/bin/buildingid
Use pip to remove the pnnl-buildingid
package:
pip3 uninstall pnnl-buildingid
Test the pnnl-buildingid
package using the nose package:
nosetests tests/
Coverage testing is enabled using the coverage package:
nosetests --with-coverage --cover-html --cover-package=buildingid tests/
The pnnl-buildingid
package supports two usages:
- Application programming interface (API)
- Command-line interface (CLI; the
buildingid
command)
buildingid.code
Code
CodeArea
area() -> float
encode() -> Code
intersection(CodeArea) -> typing.Optional[typing.Tuple[float, float, float, float]]
jaccard(CodeArea) -> typing.Optional[float]
resize() -> CodeArea
decode(Code) -> CodeArea
encode(float, float, float, float, float, float, **kwargs) -> Code
isValid(Code) -> bool
Note: The Optional
and Tuple
type hints are provided by the typing module.
In the following example, a UBID code is decoded and then re-encoded:
#!/usr/bin/env python3 # -*- coding: utf-8 -*- import buildingid.code if __name__ == '__main__': # Initialize UBID code. orig_code = '849VQJH6+95J-51-58-42-50' print(orig_code) # Decode UBID code. orig_code_area = buildingid.code.decode(orig_code) print(orig_code_area) # Resize resulting UBID code area. # # The effect of this operation is that the length and width of the UBID code # area are reduced by half an OLC code area. new_code_area = orig_code_area.resize() print(new_code_area) # Encode new UBID code area. new_code = new_code_area.encode() print(new_code) # Test that new UBID code and UBID code area match the originals. assert (orig_code == new_code) assert (orig_code_area == new_code_area)
View the documentation for the buildingid
command using the --help
command-line option:
buildingid --help #=> Usage: buildingid [OPTIONS] COMMAND [ARGS]... #=> <<more lines of output>>
View the documentation for a sub-command of the buildingid
command using the --help
command-line option.
For example, to view the documentation for the "append2csv" sub-command:
buildingid append2csv --help #=> Usage: buildingid append2csv [OPTIONS] [latlng|wkb|wkt] #=> <<more lines of output>>
Command name | Description |
---|---|
append2csv | Read CSV file from stdin, append UBID field, and write CSV file to stdout. |
crossref | Read two CSV files, cross-reference UBID fields, and write CSV file. |
Instructions in this section use Bash syntax.
buildingid
command is installed.- Verify installation:
buildingid --version
- Expected output: "buildingid, version 2.0.0" (or higher version)
- Verify installation:
- Locate input CSV file, e.g.,
path/to/in.csv
. - Locate output CSV file (generated), e.g.,
path/to/out.csv
. - Locate errors CSV file (generated), e.g.,
path/to/err.csv
. - Identify number of digits in Open Location Code (OLC) part of UBID code string, e.g., 11.
- Identify column of output CSV file that contains UBID code strings, e.g., "UBID".
- If input CSV file contains latitude and longitude coordinates for a centroid only:
- Identify columns of input CSV file that contain latitude and longitude coordinates, e.g., "Latitude" and "Longitude".
- Assign UBIDs:
buildingid append2csv latlng --code-length=11 --fieldname-code="UBID" --fieldname-center-latitude="Latitude" --fieldname-center-longitude="Longitude" < path/to/in.csv > path/to/out.csv 2> path/to/err.csv
- If input CSV file contains latitude and longitude coordinates for (i) a centroid and (ii) the northeast and southwest corners of a bounding box:
- Identify columns of input CSV file that contain latitude and longitude coordinates, e.g., "Latitude_C", "Longitude_C", "Latitude_N", "Longitude_E", "Latitude_S", and "Longitude_W".
- Assign UBIDs:
buildingid append2csv latlng --code-length=11 --fieldname-code="UBID" --fieldname-center-latitude="Latitude_C" --fieldname-center-longitude="Longitude_C" --fieldname-north-latitude="Latitude_N" --fieldname-east-longitude="Longitude_E" --fieldname-south-latitude="Latitude_S" --fieldname-west-longitude="Longitude_W" < path/to/in.csv > path/to/out.csv 2> path/to/err.csv
- If input CSV file contains hex-encoded well-known binary (WKB) strings:
- Identify column of input CSV file that contains hex-encoded WKB strings, e.g., "WKB".
- Assign UBIDs:
buildingid append2csv wkb --code-length=11 --fieldname-code="UBID" --fieldname-wkbstr="WKB" < path/to/in.csv > path/to/out.csv 2> path/to/err.csv
- If input CSV file contains well-known text (WKT) strings:
- Identify column of input CSV file that contains WKT strings, e.g., "WKT".
- Assign UBIDs:
buildingid append2csv wkt --code-length=11 --fieldname-code="UBID" --fieldname-wktstr="WKT" < path/to/in.csv > path/to/out.csv 2> path/to/err.csv
See buildingid append2csv --help
for full help.
buildingid
command is installed.- Verify installation:
buildingid --version
- Expected output: "buildingid, version 2.0.0" (or higher version)
- Verify installation:
- Locate left input CSV file, e.g.,
path/to/left.csv
. - Locate right input CSV file, e.g.,
path/to/right.csv
. - Locate output CSV file (generated), e.g.,
path/to/out.csv
. - Identify column of left input CSV file that contains UBID code strings, e.g., "UBID".
- Identify column of right input CSV file that contains UBID code strings, e.g., "UBID".
- Cross-reference UBIDs:
buildingid crossref path/to/left.csv path/to/right.csv path/to/out.csv --left-fieldname-code="UBID" --right-fieldname-code="UBID"
See buildingid crossref --help
for full help.
Default behavior is for output CSV file to be many-to-many (i.e., many records in left input CSV file are cross-referenced with many records in right input CSV file).
Use --left-group-by-jaccard
and --right-group-by-jaccard
options for one-to-many and many-to-one, respectively.
Default behavior is for output CSV file to include only columns that contain UBID code strings.
Use --include-left-field
and --include-right-field
options to include other columns.
- Geospatial Data Abstraction Library (GDAL) is installed.
- Verify installation:
ogr2ogr --version
- Expected output: "GDAL 2.3.1, released 2018/06/22" (version and release date may vary)
- Verify installation:
- Locate input Esri shapefile, e.g.,
path/to/in.shp
. - Locate input PRJ file, e.g.,
path/to/in.prj
. - Locate output CSV file (generated), e.g.,
path/to/out.csv
. - Convert input Esri shapefile into output CSV file:
ogr2ogr -s_srs "$(cat path/to/in.prj)" -t_srs "EPSG:4326" -f CSV path/to/out.csv path/to/in.shp -lco GEOMETRY=AS_WKT
See ogr2ogr --long-usage
for full help.
Output CSV file has added "WKT" column whose elements are well-known text (WKT) strings; enabled by -lco GEOMETRY=AS_WKT
option.
Projection system for geographic coordinates in output CSV file is WGS84; enabled by -t_srs "EPSG:4326"
option.
Records in input Esri shapefile are converted into rows in output CSV file, where fields in input Esri shapefile are converted into columns in output CSV file.
Shapes in input Esri shapefile are converted into elements of "WKT" column of output CSV file.
The City of Chicago's open data portal hosts the "Building Footprints (current)" dataset in CSV format; available at: https://data.cityofchicago.org/api/views/syp8-uezg/rows.csv?accessType=DOWNLOAD.
The "the_geom" column of the input CSV file contains WKT strings.
To assign UBIDs to the records in the input CSV file:
buildingid append2csv wkt --code-length=11 --fieldname-code="UBID" --fieldname-wktstr="the_geom" < rows.csv > rows.out.csv 2> rows.err.csv
The City of San Jose hosts datasets that include building footprints and land parcels.
The contents of the "Basemap_2" zip archive includes a building footprints dataset in Esri shapefile format. The coordinate system is NAD 1983 StatePlane California III FIPS 0403 Feet.
To convert the Esri shapefile into CSV format and then assign UBIDs to the resulting CSV file:
ogr2ogr -s_srs "$(cat Basemap2_201905021152225992/BuildingFootprint.prj)" -t_srs "EPSG:4326" -f CSV BuildingFootprint.csv Basemap2_201905021152225992/BuildingFootprint.shp -lco GEOMETRY=AS_WKT
buildingid append2csv wkt --code-length=11 --fieldname-code="UBID" --fieldname-wktstr="WKT" < BuildingFootprint.csv > BuildingFootprint.out.csv 2> BuildingFootprint.err.csv
Contributions are accepted on GitHub via the fork and pull request workflow. See here for more information.