The SQL plugin allows you to write your own SQL queries and use them into the Pipeline stack.
SQL, Structured Query Language, is a language for manipulating databases.
composer require php-etl/sql-plugin
The SQL plugin uses the PDO extension and relies on its interface to access databases using
the dsn
, username
and password
parameters.
This connection must be present in any case, whether it be when defining the extractor, loader or lookup.
connection:
dsn: 'mysql:host=127.0.0.1;port=3306;dbname=kiboko'
username: username
password: password
It is possible to specify options at the time of this connection using options
. Currently, it is only possible to
specify if the database connection should be persistent.
connection:
# ...
options:
persistent: true
sql:
extractor:
query: 'SELECT * FROM table1'
connection:
dsn: 'mysql:host=127.0.0.1;port=3306;dbname=kiboko'
username: username
password: password
sql:
lookup:
query: 'SELECT * FROM table2 WHERE bar = foo'
merge:
map:
- field: '[options]'
expression: 'lookup["name"]'
connection:
dsn: 'mysql:host=127.0.0.1;port=3306;dbname=kiboko'
username: username
password: password
sql:
loader:
query: 'INSERT INTO table1 VALUES (bar, foo, barfoo)'
connection:
dsn: 'mysql:host=127.0.0.1;port=3306;dbname=kiboko'
username: username
password: password
Thanks to the SQL plugin, it is possible to write your queries with parameters.
If you write a prepared statement using named parameters (:param
), your parameter key in the configuration will be
the name of your parameter without the :
:
sql:
loader:
query: 'INSERT INTO table1 VALUES (:value1, :value2, :value3)'
parameters:
- key: value1
value: '@=input["value1"]'
- key: value2
value: '@=input["value3"]'
- key: value3
value: '@=input["value3"]'
# ...
If you are using a prepared statement using interrogative markers (?
), your parameter key in the
configuration will be its position (starting from 1) :
sql:
loader:
query: 'INSERT INTO table1 VALUES (?, ?, ?)'
parameters:
- key: 1
value: '@=input["value1"]'
- key: 2
value: '@=input["value3"]'
- key: 3
value: '@=input["value3"]'
# ...
In some cases, you may need to run queries in order to best prepare for the execution of your pipeline.
Before queries will be executed before performing the query written in the configuration. Often, these are queries that set up the database.
sql:
before:
queries:
- 'CREATE TABLE foo (id INTEGER NOT NULL, value VARCHAR(255) NOT NULL)'
- 'INSERT INTO foo (id, value) VALUES (1, "Lorem ipsum dolor")'
- 'INSERT INTO foo (id, value) VALUES (2, "Sit amet consecutir")'
# ...
After queries will be executed after performing the query written in the configuration. Often, these are queries that clean up the database.
sql:
after:
queries:
- 'DROP TABLE foo'
# ...