Follow the instructions here to install the Dremio docker container. Make sure to set up the user name and password in the container as follows:
username="admin"
password="admin01!"
In the top right, click the + button, and add the CSV in utilities/data from this repo.
Finally, to ensure the data types are all correct, run the below SQL:
SELECT TO_DATE(AsofDate, 'YYYY-MM-DD', 1) AS AsofDate,
CUSIP,
SectorCode,
Strategy,
CAST(MarketValue as DOUBLE) as MarketValue,
CAST(EffectiveDuration as DOUBLE) as EffectiveDuration,
CAST(OptionAdjustedSpread as DOUBLE) as OptionAdjustedSpread, LegalEntity, PortfolioTicker, OwnershipType,
CAST(PercentWeight as DOUBLE) as PercentWeight
FROM "@admin".SampleData20240104 AS SampleData20240104;
Run the CreateEnvironment.sh
script to install the experimental version of the python API that supports a generic JDBC connector. This uses a Dremio Dialect JAR that is in utilities/extra_jars
.
Finally, you will need to download the Dremio JDBC driver here.
After this, you should be able to run StartDemo.sh
to start the environment in Jupyter Lab. There are two notebooks, a DQ version and an in-memory version, pulling from the same table.