Skip to content

Commit

Permalink
Merge pull request #18 from progress/kiranb-main
Browse files Browse the repository at this point in the history
Initial hdp official helm chart project
  • Loading branch information
kiranbabuneela authored May 28, 2024
2 parents f5873d6 + 647d051 commit d31fa7f
Show file tree
Hide file tree
Showing 16 changed files with 879 additions and 0 deletions.
31 changes: 31 additions & 0 deletions charts/hybriddatapipeline/Chart.yaml
Original file line number Diff line number Diff line change
@@ -0,0 +1,31 @@
annotations:
category: DataConnectivity

apiVersion: v2

name: hybriddatapipeline

description: Progress® DataDirect® Hybrid Data Pipeline is a light-weight software service that provides simple, secure access to cloud and on-premises data for business intelligence tools and applications. Client applications can use ODBC, JDBC, or OData to access data from a broad range of data stores. Requests from client applications are translated into the format supported by the underlying data store – SQL, NoSQL, Big Data, cloud – and returned in the format accepted by the client. Communications in HTTP and HTTPS are supported. In addition to supporting connectivity to a wide range of relational and non-relational database management systems, Hybrid Data Pipeline supports connectivity to REST sources with a built-in Autonomous REST Connector. Moreover, third-party connectors can be integrated with Hybrid Data Pipeline, expanding the range of data stores to which the service can connect.

type: application

appVersion: "4.6.1"

icon: 

sources:
- https://github.com/progress/hdp-kubernetes-preview
- https://docs.progress.com/bundle/datadirect-hybrid-data-pipeline-46

dependencies:
- name: postgresql
repository: https://charts.bitnami.com/bitnami
version: "13.2.30"
condition: postgres.enabled
- name: kubernetes-ingress
alias: kubernetesingress
repository: https://haproxytech.github.io/helm-charts/
version: "1.38.2"
condition: haproxy.kubernetesIngress.enabled

version: 0.1.0
92 changes: 92 additions & 0 deletions charts/hybriddatapipeline/ReadMe.txt
Original file line number Diff line number Diff line change
@@ -0,0 +1,92 @@
# Hybrid Data Pipeline Kubernetes Helm Chart

This repository contains a Helm Chart that can be used to deploy Hybrid Data Pipeline on a Kubernetes cluster. Below is a brief description of how to easily create a Hybrid Data Pipeline StatefulSet for development and testing.

## Getting Started

### Prerequisites

[Helm](https://helm.sh/docs/intro/install/), [Kubectl](https://kubernetes.io/docs/tasks/tools/), [AzureCLI](https://learn.microsoft.com/en-us/cli/azure/install-azure-cli) must be installed locally in order to use this chart.

In this release, the Helm chart is supported in Azure Kubernetes Service.

### Kubernetes Version

This Helm-chart currently support Kubernetes 1.27 or later.

### Installing Hybrid Data Pipeline Helm Chart

This below example Helm Chart installation will create a two-node Hybrid Data Pipeline cluster with a "Default" group. A 1GB persistent volume, 2 vCPUs, and 4GB of RAM will be allocated for the pod.

1. Add Hybrid Data Pipeline Repo to Helm:
```
helm repo add hybriddatapipeline https://github.com/progress/hdp-kubernetes-preview
```
2. Create a Kubernetes namespace:
```
kubectl create namespace hybriddatapipeline
```
3. The Hybrid Data Pipeline Helm Chart relies on two sub-charts, namely PostgreSQL and HAProxy.

To download the dependencies of the chart, execute:
```
helm dependency build
```
This action will retrieve the chart to /charts.

4. When installing the Helm Chart, the secrets for PostgreSQL and Hybrid Data Pipeline are required to be created. To create secrets:
```
kubectl create secret secrets/hdp-secrets.yaml
kubectl create secret secrets/postgres-secrets.yaml
```

5. Adjust the settings in the values.yaml file to create a two-node Hybrid Data Pipeline cluster with a minimum resource allocation of 2 vCPUs, 8 GB RAM, and 100 GB storage for Hybrid Data Pipeline Server. For detailed guidance, refer to the Hybrid Data Pipeline Product Requirements Documentation.

Utilize the latest Hybrid Data Pipeline Server Docker image for the new implementation as specified in the values.yaml file. To access the most recent image available, consult Progress ESD.

If necessary, push the Hybrid Data Pipeline Server Docker image to a container registry and update the image.repository and image.tag values accordingly.

Should you desire HAProxy Load Balancer service for serving Hybrid Data Pipeline Server pod containers, perform these actions:
Specify an FQDN name in hdp.loadbalancer.hostName.
Set haproxy.kubernetesIngress.enabled to true (default is true).

```
## Progress DataDirect Hybrid Data Pipeline Server parameters
hdp:

# Number of HDP nodes
replicaCount: 2

## Progress DataDirect Hybrid Data Pipeline Server image parameters
image:
repository:
tag:
pullPolicy: IfNotPresent

## Progress DataDirect Hybrid Data Pipeline Server Container persistence parameters
persistence:
mountPath: /hdpshare
size: 1Gi
storageClassName: azurefile-csi

## Progress DataDirect Hybrid Data Pipeline Server Container resources parameters
resources:
requests:
memory: "4096Mi"
cpu: "2000m"
limits:
memory: "8096Mi"
cpu: "4000m"

## Progress DataDirect Hybrid Data Pipeline Server License parameters
licenseKey:

## Progress DataDirect Hybrid Data Pipeline Server Container load balancer parameters
loadbalancer:
hostName:
```
6. Install the Hybrid Data Pipeline Helm Chart with the above custom settings.
```
helm install my-release hdp-kubernetes-preview/hybriddatapipeline --values values.yaml --namespace=hybriddatapipeline
```
Once the installation is complete and the pod is in a running state, the Hybrid Data Pipeline can be accessed using hostname as configured for the hdp.loadbalancer.hostName in values.yaml
189 changes: 189 additions & 0 deletions charts/hybriddatapipeline/config/hdpdeploy.properties
Original file line number Diff line number Diff line change
@@ -0,0 +1,189 @@
; This properties file is used to configure a Docker deployment of Hybrid Data
; Pipeline (HDP). Specify values for each property to suit your environment. A
; number of values can be set using environment variables. Environment variables
; must be specified in the Docker run command used to deploy the service. For
; example:
; docker run -dt -p 8080:8080 -p 8443:8443 -e "ACCEPT_EULA=true"
; -v /opt/hdpshare:/hdpshare --hostname hdp1 --name hdp1
; --network hdp-net %HDP_DOCKER_IMAGE%
; -e HDP_USER_PASSWORD=MyHDPUserPassword
; -e HDP_ADMIN_PASSWORD=MyHDPAdminPassword
; The names of supported environment variables match the names of the properties
; in this file. For more information about using environment variables, refer to
; "Using environment variables for server and system database credentials":
; https://docs.progress.com/bundle/datadirect-hybrid-data-pipeline-installation-46/page/Using-environment-variables-for-server-and-system-database-credentials.html

; HDP server hostname
; -------------------
; The fully qualified hostname of the HDP server. In a Docker deployment, the
; hostname is the name of the Docker container host. If a hostname is not
; supplied, the Docker image will attempt to detect it.
HDP_SERVER={{ .Release.Name }}-{{ .Values.hdp.services.hdpService.name }}

; HDP SSL certificate
; -------------------
; The name of the SSL certificate file used by the HDP server to establish SSL
; connections with ODBC and JDBC client applications. The certificate file must
; be placed in the shared volume. (NOTE: The default location of the shared
; volume for each demo is C:\hdpshare.) The value provided depends on whether
; you are deploying HDP as a cluster or as a standalone node.
; For cluster deployments, the specified file must be the root certificate used
; to sign the certificate for the load balancer server. PEM, DER, and Base64
; encodings are supported.
; For standalone deployments, you have two options.
; (1) Leave blank to use the self-signed certificate included with the
; deployment.
; (2) Specify the SSL certificate in PEM file format. The PEM file must consist
; of a private key, a public key certificate issued by a certificate authority
; (CA), and additional certificates that make up the trust chain.
; For more information about creating a PEM file, refer to "The PEM file":
; https://docs.progress.com/bundle/datadirect-hybrid-data-pipeline-installation-46/page/The-PEM-file.html.
HDP_SERVER_CERT_FILE=

; System database properties
; ==========================
; An external system database is required for this Docker deployment of HDP.
; For supported databases and requirements, refer to "External system
; databases":
; https://docs.progress.com/bundle/datadirect-hybrid-data-pipeline-installation-46/page/External-system-databases.html.
; The following properties are used to configure the external database. Several
; properties are specific to the database vendor.

; Required. The hostname of the system database.
; NOTE: If you are locally deploying the system database as a Docker container,
; the hostname does not resolve correctly if set to 'localhost'. Instead, you
; must use use the Docker-specific lookup name 'host.docker.internal'.
HDP_DATABASE_HOSTNAME={{ .Release.Name }}-{{.Values.hdp.database.hostName}}

; Required. The port of the system database.
HDP_DATABASE_PORT={{.Values.hdp.database.port}}

; Required for PostgreSQL and MySQL databases. The name of the system database.
HDP_DATABASE_NAME={{.Values.hdp.database.name}}

; Required. The name of the system database vendor.
; Valid values include: oracle | mssqlserver | postgresql | mysqlcommunity
HDP_DATABASE_VENDOR=postgresql

; Required if you are using a MySQL Community Edition system database. The
; MySQL Connector/J jar file must be placed in the hdpshare volume, and the name
; of the jar file must be specified with this property.
HDP_MYSQL_DRIVER=

; Required if you are using an Oracle system database with the Oracle System
; Identifier (SID). Specify the SID for this property. Otherwise, leave this
; property blank.
HDP_ORACLE_SID=

; Required if you are using an Oracle system database with the service name. For
; this property, provide the database service name that specifies the database
; that is used for the connection. The service name is a string that is the
; global database name. The global database name typically comprises the
; database name and domain name. Otherwise, leave this property blank.
HDP_ORACLE_SERVICE_NAME=

; Required if you are using a SQL Server system database. Provide the name of
; the SQL Server database schema.
HDP_SQLSERVER_SCHEMA=

; Additional connection parameters may be included in the database connection
; URL. For supported parameters, refer to the data store topic associated with
; your system database in the HDP user's guide:
; https://docs.progress.com/bundle/datadirect-hybrid-data-pipeline-46/page/Supported-data-stores.html
; Parameters should be provided as a list of key-value pairs. For a MySQL
; Community database, the key-value pairs should be separated by an ampersand.
; For other databases, the key-value pairs should be separated by a semicolon.
HDP_DATABASE_ADVANCED_OPTIONS={{.Values.hdp.database.advancedOptions}}

; Required but may be set as an environment variable. The name of the database
; user without administrator privileges. Required privileges depend on the type
; of database you are using. For details on required privileges, refer to
; "External system databases":
; https://docs.progress.com/bundle/datadirect-hybrid-data-pipeline-installation-46/page/External-system-databases.html.
HDP_DATABASE_USER={{ include "hybriddatapipeline.secretValue" (list "default" "hdp-secrets" "hdpDbUser") | trim }}

; Required but may be set as an environment variable. The password of the
; database user without administrator privileges.
HDP_DATABASE_USER_PASSWORD={{ include "hybriddatapipeline.secretValue" (list "default" "hdp-secrets" "hdpDbUserPwd") | trim }}

; Required but may be set as an environment variable. The name of the database
; user with administrator privileges. Required privileges depend on the type of
; database you are using. For details on required privileges, refer to "External
; system databases":
; https://docs.progress.com/bundle/datadirect-hybrid-data-pipeline-installation-46/page/External-system-databases.html.
HDP_DATABASE_ADMIN={{ include "hybriddatapipeline.secretValue" (list "default" "hdp-secrets" "hdpDbAdmin") | trim }}

; Required but may be set as an environment variable. The password of the
; database user with administrator privileges.
HDP_DATABASE_ADMIN_PASSWORD={{ include "hybriddatapipeline.secretValue" (list "default" "hdp-secrets" "hdpDbAdminPwd") | trim }}

; On-Premises Connector
; ---------------------
; Specify 'yes' if you plan to deploy the On-Premises Connector.
; Specify 'no' if you are not deploying the On-Premises Connector.
; If 'yes' is specified, the HDP server uses the On-Premises Access Port (40501)
; and the Notification Server Port (11443) to communicate with the On-Premises
; Connector. These ports must be made available across the firewall. The default
; value is 'yes'.
HDP_ENABLE_ONPREMISE_CONNECTOR=yes

; HDP Logon credentials
; =====================
; HDP provides the default d2cadmin and d2cuser accounts. You must specify
; passwords for each of these accounts with the following properties.

; Required but may be set as an environment variable. The password for the
; default administrator account.
HDP_ADMIN_PASSWORD={{ include "hybriddatapipeline.secretValue" (list "default" "hdp-secrets" "hdpAdminPwd") | trim }}

; Required but may be set as an environment variable. The password for the
; default user account.
HDP_USER_PASSWORD={{ include "hybriddatapipeline.secretValue" (list "default" "hdp-secrets" "hdpUserPwd") | trim }}

; Load balancer
; -------------
; Required. Indicates whether or not a load balancer is being used.
; Valid values: none | tunnel | websocket
; * 'none' should be specified for standalone deployments.
; * 'tunnel' should be specified for load balancer routing using tunneling.
; Tunneling is generally used for network load balancing.
; * 'websocket' should be specified for load balancer routing using the
; WebSocket protocol. The WebSocket protocol is generally used for load
; balancing when using a hosted service such as AWS or Azure.
HDP_LOAD_BALANCER_TYPE=cloud

; Load balancer name
; ------------------
; Required if a load balancer is being used. The hostname or IP address of the
; machine where the load balancer is running.
HDP_LOAD_BALANCER_HOST_NAME={{ .Values.hdp.loadbalancer.hostName }}

; The name of the SSL certificate file used by the load balancer to establish SSL
; connections with ODBC and JDBC client applications. The certificate file must be
; placed in the shared volume. This must be the root certificate used in the
; wildcard certificate chain.
; This certificate must be provided in a Base64 encoding, such as a
; PEM or DER file format.
; For more information about creating a PEM file, refer to "The PEM file":
; https://docs.progress.com/bundle/datadirect-hybrid-data-pipeline-installation-46/page/The-PEM-file.html.
HDP_LOAD_BALANCER_CERT_FILE={{ .Values.haproxy.tls.certFileName }}

; License key
; -----------
; Leave blank to deploy an evaluation version of the product.
; Provide a valid license key to deploy a licensed version of the product.
HDP_LICENSE_KEY={{ .Values.hdp.licenseKey }}

; JVM heap size
; =============
; JVM heap size properties may be specified if you need to limit the memory
; allocated to HDP nodes. Specifying heap size may be particularly useful when
; running multiple Docker nodes on a single host machine.

; The initial JVM heap size in megabytes. The default is 4096 megabytes.
HDP_HEAP_INITIAL={{- .Values.hdp.resources.requests.memory | trunc 63 | trimSuffix "Mi" }}

; The maximum JVM heap size in megabytes. This value cannot be less than the
; initial JVM heap size. The default value is the value specified for the
; HDP_HEAP_INITIAL property.
HDP_HEAP_MAX={{- .Values.hdp.resources.limits.memory | trunc 63 | trimSuffix "Mi" }}
11 changes: 11 additions & 0 deletions charts/hybriddatapipeline/secrets/hdp-secrets.yaml
Original file line number Diff line number Diff line change
@@ -0,0 +1,11 @@
apiVersion: v1
kind: Secret
metadata:
name: hdp-secrets
data:
hdpDbUser: cGd1c2Vy
hdpDbUserPwd: cGd1c3Jwd2QhQCMxMjM=
hdpDbAdmin: cGdhZG1pbg==
hdpDbAdminPwd: cGdhZG1wd2QhQCMxMjM=
hdpAdminPwd: ZDJjYWRtIUAjMTIz
hdpUserPwd: ZDJjdXNyIUAjMTIz
12 changes: 12 additions & 0 deletions charts/hybriddatapipeline/secrets/postgres-secrets.yaml
Original file line number Diff line number Diff line change
@@ -0,0 +1,12 @@
apiVersion: v1
kind: Secret
metadata:
name: postgres-secrets
data:
postgres.db.name: cG9zdGdyZXM=
postgres.db.username: cG9zdGdyZXM=
postgres.db.password: ZG9ja2Vy
postgres.hdp.pgadmin.user: cGdhZG1pbg==
postgres.hdp.pgadmin.pwd: cGdhZG1wd2QhQCMxMjM=
postgres.hdp.pguser.user: cGd1c2Vy
postgres.hdp.pguser.pwd: cGd1c3Jwd2QhQCMxMjM=
Loading

0 comments on commit d31fa7f

Please sign in to comment.