Algorithm Container for Jupyter and Zeppelin Notebooks
-
Download the '.ipynb' file from the current repository and import it in Jupyter.
-
Open the downloaded 'AEC notebook'. Select and run the first paragraph of the notebook. An initialisation button (labelled 'Initialise') should appear as an output of the paragraph's execution.
-
Press the initialisation button and wait for the following paragraphs to run. You can monitor the kernel state shown in the upper right part of the notebook to know when the execution is finished.
-
Once all paragraphs have finished running, you should see the main AEC UI, which includes five tabs.
-
From the 'Input File' tab select the Dataset and then the file that contains the data to be used in the analysis. You may also select the appropriate separator, as AEC only works with delimited files (csv).
-
From the 'Algorithm Selection & Configuration' tab select the algorithm family and then the specific algorithm you want to apply. Once an algorithm is selected, a configuration form will appear. Configure all algorithm parameters here before proceeding.
-
From the 'Output File; tab select the Dataset and choose a name for the folder where the analysis results should be saved.
-
From the 'Overview' tab you can have a quick view of the current selections across the previous three tabs. If everything looks ok, press the 'Execute' button and wait for the algorithm to be applied. Once the analysis is finished, results will appear in this tab.
-
From the 'Model Analyis' tab you can re-run a classification or a regression based on the model just created
-
In the AEGIS Data Store
Project Datasets
, locate the dataset to be used,select
it andcopy its hdfs path
(upper right corner) -
Invoke the Container by choosing
Analytics
on the right sidebar. If Anaconda is not enabled, choose to enable it and wait until all installation scripts are performed -
Create a Zeppelin notebook by
-
Copy the dataset's dhfs URL into the field
Set Dataset path
-
Select the
Separator
-
Select the desired
Analysis Type
and then the actualAlgorithm
-
Trim the algorithm's
configuration parameters
as needed -
Run
the Analysis -
You may view the analysis results in the predefined Zeppelin visualisation charts. Also the results of the analysis are stored back in the AEGIS Data Store in your project's folder