Developers should read the development section of the website, which covers thing like development philosophy and contribution process.
More information about the writing and building the documentation can be found in the docs module.
When writing a Git commit message, follow these guidelines.
Pull requests are usually merged into master
using the rebase and merge
strategy.
A typical pull request should strive to contain a single logical change (but not necessarily a single commit). Unrelated changes should generally be extracted into their own PRs.
If a pull request contains a stack of more than one commit, then popping any number of commits from the top of the stack, should not break the PR, ie. every commit should build and pass all tests.
Commit messages and history are important as well, because they are used by other developers to keep track of the motivation behind changes. Keep logical diffs grouped together in separate commits and order commits in a way that explains by itself the evolution of the change. Rewriting and reordering commits is a natural part of the review process. Mechanical changes like refactoring, renaming, removing duplication, extracting helper methods, static imports should be kept separated from logical and functional changes like adding a new feature or modifying code behaviour. This makes reviewing the code much easier and reduces the chance of introducing unintended changes in behavior.
Whenever in doubt on splitting a change into a separate commit, ask yourself the following question: if all other work in the PR needs to be reverted after merging to master for some objective reason (eg. a bug has been discovered), is it worth keeping that commit still in master.
We recommend you use IntelliJ as your IDE. The code style template for the project can be found in the codestyle repository along with our general programming and Java guidelines.
To run checkstyle and other maven checks before opening a PR: ./mvnw validate
In addition to those you should also adhere to the following:
The purpose of code style rules is to maintain code readability and developer efficiency when working with the code. All the code style rules explained below are good guidelines to follow but there may be exceptional situations where we purposefully depart from them. When readability and code style rule are at odds, the readability is more important.
Keep code consistent with surrounding code where possible.
Alphabetize sections in the documentation source files (both in the table of contents files and other regular documentation files).
When appropriate, use the stream API. However, note that the stream implementation does not perform well so avoid using it in inner loops or otherwise performance sensitive sections.
Categorize errors when throwing exceptions. For example, TrinoException
takes
an error code as an argument, TrinoException(HIVE_TOO_MANY_OPEN_PARTITIONS)
.
This categorization lets you generate reports so you can monitor the frequency
of various failures.
Ensure that all files have the appropriate license header; you can generate the
license by running mvn license:format
.
Consider using String formatting (printf style formatting using the Java
Formatter
class): format("Session property %s is invalid: %s", name, value)
(note that format()
should always be statically imported). Sometimes, if you
only need to append something, consider using the +
operator. Please avoid
format()
or concatenation in performance critical sections of code.
Avoid using the ternary operator except for trivial expressions.
It is suggested to declare members in private inner classes as public if they are part of the class API.
Do not use mocking libraries. These libraries encourage testing specific call sequences, interactions, and other internal behavior, which we believe leads to fragile tests. They also make it possible to mock complex interfaces or classes, which hides the fact that these classes are not (easily) testable. We prefer to write mocks by hand, which forces code to be written in a certain testable style.
Prefer AssertJ for complex assertions.
For thing not easily expressible with AssertJ, use Airlift's Assertions
class
if there is one that covers your case.
Using var
is discouraged.
Prefer using immutable collections from Guava over unmodifiable collections from JDK. The main motivation behind this is deterministic iteration.
Maintain the same quality for production and test code.
Please avoid abbreviations, slang or inside jokes as this makes harder for
non-native english speaker to understand the code. Very well known
abbreviations like max
or min
and ones already very commonly used across
the code base like ttl
are allowed and encouraged.
Avoid using the default
clause when the switch statement is meant to cover all
the enum values. Handling the unknown option case after the switch statement
allows static code analysis tools (e.g. Error Prone's MissingCasesInEnumSwitch
check) report a problem when the enum definition is updated but the code using
it is not.
There are several plugins in place to keep pom.xml clean. Your build may fail if:
- dependencies or XML elements are not ordered correctly
- overall pom.xml structure is not correct
Many such errors may be fixed automatically by running the following:
./mvnw sortpom:sort
When using IntelliJ to develop Trino, we recommend starting with all of the default inspections, with some modifications.
Enable the following inspections:
Java | Internationalization | Implicit platform default charset
,Java | Control flow issues | Redundant 'else'
(includingReport when there are no more statements after the 'if' statement
option),Java | Class structure | Utility class is not 'final'
,Java | Class structure | Utility class with 'public' constructor
,Java | Class structure | Utility class without 'private' constructor
.
Disable the following inspections:
Java | Performance | Call to 'Arrays.asList()' with too few arguments
,Java | Abstraction issues | 'Optional' used as field or parameter type
,Java | Data flow | Boolean method is always inverted
.
Update the following inspections:
- Remove
com.google.common.annotations.Beta
fromJVM languages | Unstable API usage
.
Enable errorprone (Error Prone Installation#IDEA):
- Install
Error Prone Compiler
plugin from marketplace, - Check the
errorprone-compiler
profile in the Maven tab
This should be enough - IDEA should automatically copy the compiler options from the POMs to each module. If that doesn't work, you can do it manually:
- In
Java Compiler
tab, selectJavac with error-prone
as the compiler, - Update
Additional command line parameters
and copy the contents ofcompilerArgs
in the top-level POM (except for-Xplugin:ErrorProne
) there- Remove the XML comments...
- ...except the ones which denote checks which fail in IDEA, which you should "unwrap"
- Remove everything from the list under
Override compiler parameters per-module
Note that the version of errorprone used by the IDEA plugin might be older than
the one configured in the pom.xml
and you might need to disable some checks
that are not yet supported by that older version. When in doubt, always check
with the full Maven build (./mvnw clean install -DskipTests -Perrorprone-compiler
).
In order to enable language injection inside Intellij IDEA, some code elements
can be annotated with the @org.intellij.lang.annotations.Language
annotation.
To make it useful, we recommend:
- Set the project-wide SQL dialect in
Languages & Frameworks | SQL Dialects
"Generic SQL" is a decent choice here, - Disable inspection
SQL | No data source configured
, - Optionally disable inspection
Language injection | Language mismatch
.
Even if the IDE does not support language injection this annotation is useful
for documenting the API's intent. Considering the above, we recommend annotating
with @Language
:
- All API parameters which are expecting to take a
String
containing an SQL statement (or any other language, like regular expressions), - Local variables which otherwise would not be properly recognized by IDE for language injection.
The Trino Web UI is composed of several React components and is written in JSX
and ES6. This source code is compiled and packaged into browser-compatible
Javascript, which is then checked in to the Trino source code (in the dist
folder). You must have Node.js and
Yarn installed to execute these commands. To update
this folder after making changes, simply run:
yarn --cwd core/trino-main/src/main/resources/webapp/src install
If no Javascript dependencies have changed (i.e., no changes to package.json
),
it is faster to run:
yarn --cwd core/trino-main/src/main/resources/webapp/src run package
To simplify iteration, you can also run in watch
mode, which automatically
re-compiles when changes to source files are detected:
yarn --cwd core/trino-main/src/main/resources/webapp/src run watch
To iterate quickly, simply re-build the project in IntelliJ after packaging is complete. Project resources will be hot-reloaded and changes are reflected on browser refresh.
Trino aims for frequent releases, generally once per week. This is a goal but not a guarantee, as critical bugs may lead to a release being pushed back or require an extra emergency release to patch the issue.
At the start of each release cycle, a GitHub issue is filed and pinned to track all necessary release notes. For example, see the issue for Trino 395. In addition, a release notes pull request is updated and maintained throughout the week, tracking all merged commits to ensure every change is properly documented and noted. This uses the release note template, with changes in each section arranged to have new features first, performance improvements second, and bugfixes third. See the release notes for 395 as an example.
Once it is time to release, the release process is kicked off. A code freeze is announced on the Trino Slack in the #releases channel, and then a maintainer utilizes the release scripts to update Trino to the next version.