Skip to content

Commit

Permalink
Merge branch 'main' into idtoken
Browse files Browse the repository at this point in the history
  • Loading branch information
minrk authored Feb 13, 2024
2 parents 26ea6fa + 8dfe4f4 commit 18f2d19
Show file tree
Hide file tree
Showing 23 changed files with 280 additions and 54 deletions.
4 changes: 2 additions & 2 deletions .pre-commit-config.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -34,7 +34,7 @@ repos:

# Autoformat: Python code
- repo: https://github.com/psf/black
rev: 23.12.1
rev: 24.1.1
hooks:
- id: black

Expand Down Expand Up @@ -64,7 +64,7 @@ repos:

# Lint: Python code
- repo: https://github.com/pycqa/flake8
rev: "6.1.0"
rev: "7.0.0"
hooks:
- id: flake8

Expand Down
1 change: 1 addition & 0 deletions docs/source/how-to/example-oauthenticator.py
Original file line number Diff line number Diff line change
@@ -1,6 +1,7 @@
"""
Example OAuthenticator to use with My Service
"""

from jupyterhub.auth import LocalAuthenticator

from oauthenticator.oauth2 import OAuthenticator, OAuthLoginHandler
Expand Down
1 change: 1 addition & 0 deletions docs/source/index.md
Original file line number Diff line number Diff line change
Expand Up @@ -47,6 +47,7 @@ Topic guides go more in-depth on a particular topic.
:maxdepth: 2
:caption: Topic guides
topic/allowing
topic/extending
```

Expand Down
142 changes: 142 additions & 0 deletions docs/source/topic/allowing.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,142 @@
(allowing)=

# Allowing access to your JupyterHub

OAuthenticator is about deferring **authentication** to an external source,
assuming your users all have accounts _somewhere_.
But many of these sources (e.g. Google, GitHub) have _lots_ of users, and you don't want _all_ of them to be able to use your hub.
This is where **authorization** comes in.

In OAuthenticator, authorization is represented via configuration options that start with `allow` or `block`.

There are also lots of OAuth providers, and as a result, lots of ways to tell OAuthenticator who should be allowed to access your hub.

## Default behavior: nobody is allowed!

The default behavior of OAuthenticator (starting with version 16) is to block all users unless explicitly authorized via _some_ `allow` configuration.
If you want anyone to be able to use your hub, you must specify at least one `allow` configuration.

```{versionchanged} 16
Prior to OAuthenticator 16, `allow_all` was _implied_ if no other `allow` configuration was specified.
Starting from 16, `allow_all` can only be enabled explicitly.
```

## Allowing access

There are several `allow_` configuration options, to grant access to users according to different rules.

When you have only one `allow` configuration, the behavior is generally unambiguous: anyone allowed by the rule can login to the Hub, while anyone not explicitly allowed cannot login.
However, once you start adding additional `allow` configuration, there is some ambiguity in how multiple rules are combined.

```{important}
Additional allow rules **can only grant access**, meaning they only _expand_ who has access to your hub.
Adding an `allow` rule cannot prevent access granted by another `allow` rule.
To block access, use `block` configuration.
```

That is, if a user is granted access by _any_ `allow` configuration, they are allowed.
An allow rule cannot _exclude_ access granted by another `allow` rule.

An example:

```python
c.GitHubOAuthenticator.allowed_users = {"mensah", "art"}
c.GitHubOAuthenticator.allowed_organizations = {"preservation"}
```

means that the users `mensah` and `art` are allowed, _and_ any member of the `preservation` organization are allowed.
Any user that doesn't meet any of the allow rules will not be allowed.

| user | allowed | reason |
| ----- | ------- | ------------------------------------------------------- |
| art | True | in `allowed_users` |
| amena | True | member of `preservation` |
| tlacy | False | not in `allowed_users` and not member of `preservation` |

### `allow_all`

The first and simplest way to allow access is to any user who can successfully authenticate:

```python
c.OAuthenticator.allow_all = True
```

This is appropriate when you use an authentication provider (e.g. an institutional single-sign-on provider), where everyone who has an account in the provider should have access to your Hub.
It may also be appropriate for unadvertised short-lived hubs, e.g. dedicated hubs for workshops that will be shutdown after a day, where you may decide it is acceptable to allow anyone who finds your hub to login.

If `allow_all` is enabled, no other `allow` configuration will have any effect.

```{seealso}
Configuration documentation for {attr}`.OAuthenticator.allow_all`
```

### `allowed_users`

This is top-level JupyterHub configuration, shared by all Authenticators.
This specifies a list of users that are allowed by name.
This is the simplest authorization mechanism when you have a small group of users whose usernames you know:

```python
c.OAuthenticator.allowed_users = {"mensah", "ratthi"}
```

If this is your only configuration, only these users will be allowed, no others.

Note that any additional usernames in the deprecated `admin_users` configuration will also be allowed to login.

```{seealso}
Configuration documentation for {attr}`.OAuthenticator.allowed_users`
```

### `allow_existing_users`

JupyterHub can allow you to add and remove users while the Hub is running via the admin page.
If you add or remove users this way, they will be added to the JupyterHub database, but they will not be able to login unless they are also granted access via an `allow` rule.

To enable managing users via the admin panel, set

```python
c.OAuthenticator.allow_existing_users = True
```

```{warning}
Enabling `allow_existing_users` means that _removing_ users from any explicit allow mechanisms will no longer revoke their access.
Once the user has been added to the database, the only way to revoke their access to the hub is to remove the user from JupyterHub entirely, via the admin page.
```

```{seealso}
Configuration documentation for {attr}`.OAuthenticator.allow_existing_users`
```

### Provider-specific rules

Each OAuthenticator provider may have its own provider-specific rules to allow groups of users access, such as:

- {attr}`.GitHubOAuthenticator.allowed_organizations`
- {attr}`.GitLabOAuthenticator.allowed_gitlab_groups`
- {attr}`.GlobusOAuthenticator.allowed_globus_groups`
- {attr}`.GoogleOAuthenticator.allowed_google_groups`

## Blocking Access

It's possible that you want to limit who has access to your Hub to less than all of the users granted access by your `allow` configuration.
`block` configuration always has higher priority than `allow` configuration, so if a user is both allowed _and_ blocked, they will not be able to login.

The only `block` configuration is the base Authenticators `block_users`,
a set of usernames that will not be allowed to login.

### Revoking previously-allowed access

Any users who have logged in previously will be present in the JupyterHub database.
Removing a user's login permissions (e.g. removing them from a GitLab project when using {attr}`.GitLabOAuthenticator.project_ids`) only prevents future logins;
it does not remove the user from the JupyterHub database.
This means that:

1. any API tokens that the user still has access to will continue to be valid, and can continue to be used
2. any still-valid browser sessions will continue to be logged in.

```{important}
To fully remove a user's access to JupyterHub,
their login permission must be revoked _and_ their user fully deleted from the Hub,
e.g. via the admin page.
```
1 change: 1 addition & 0 deletions examples/auth_state/jupyterhub_config.py
Original file line number Diff line number Diff line change
Expand Up @@ -5,6 +5,7 @@
2. pass select auth_state to Spawner via environment variables
3. enable auth_state via `JUPYTERHUB_CRYPT_KEY` and `enable_auth_state = True`
"""

import os
import pprint
import warnings
Expand Down
21 changes: 21 additions & 0 deletions examples/mock-provider/README.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,21 @@
# Generic OAuth with mock provider

This example uses [mock-oauth2-server] to launch a standalone local OAuth2 provider and configures GenericOAuthenticator to use it.

mock-auth2-server implements OpenID Connect (OIDC), and can be used to test GenericOAuthenticator configurations for use with OIDC providers without needing to register your application with a real OAuth provider.

[mock-oauth2-server]: https://github.com/navikt/mock-oauth2-server

To launch the oauth provider in a container:

```
docker run --rm -it -p 127.0.0.1:8080:8080 ghcr.io/navikt/mock-oauth2-server:2.1.1
```

Then launch JupyterHub:

```
jupyterhub
```

When you login, you will be presented with a form allowing you to specify the username, and (optionally) any additional fields that should be present in the `userinfo` response.
27 changes: 27 additions & 0 deletions examples/mock-provider/jupyterhub_config.py
Original file line number Diff line number Diff line change
@@ -0,0 +1,27 @@
c = get_config() # noqa

c.JupyterHub.authenticator_class = "generic-oauth"

# assumes oauth provider run with:
# docker run --rm -it -p 127.0.0.1:8080:8080 ghcr.io/navikt/mock-oauth2-server:2.1.1

provider = "http://127.0.0.1:8080/default"
c.GenericOAuthenticator.authorize_url = f"{provider}/authorize"
c.GenericOAuthenticator.token_url = f"{provider}/token"
c.GenericOAuthenticator.userdata_url = f"{provider}/userinfo"
c.GenericOAuthenticator.scope = ["openid", "somescope", "otherscope"]

# these are the defaults. They can be configured at http://localhost:8080/default/debugger
c.GenericOAuthenticator.client_id = "debugger"
c.GenericOAuthenticator.client_secret = "someSecret"

# 'sub' is the first field in the login form
c.GenericOAuthenticator.username_claim = "sub"

c.GenericOAuthenticator.allow_all = True
c.GenericOAuthenticator.admin_users = {"admin"}

# demo boilerplate
c.JupyterHub.default_url = "/hub/home"
c.JupyterHub.spawner_class = "simple"
c.JupyterHub.ip = "127.0.0.1"
1 change: 1 addition & 0 deletions oauthenticator/auth0.py
Original file line number Diff line number Diff line change
@@ -1,6 +1,7 @@
"""
A JupyterHub authenticator class for use with Auth0 as an identity provider.
"""

import os

from jupyterhub.auth import LocalAuthenticator
Expand Down
1 change: 1 addition & 0 deletions oauthenticator/azuread.py
Original file line number Diff line number Diff line change
@@ -1,6 +1,7 @@
"""
A JupyterHub authenticator class for use with Azure AD as an identity provider.
"""

import os

import jwt
Expand Down
1 change: 1 addition & 0 deletions oauthenticator/bitbucket.py
Original file line number Diff line number Diff line change
@@ -1,6 +1,7 @@
"""
A JupyterHub authenticator class for use with Bitbucket as an identity provider.
"""

import os

from jupyterhub.auth import LocalAuthenticator
Expand Down
1 change: 1 addition & 0 deletions oauthenticator/cilogon.py
Original file line number Diff line number Diff line change
@@ -1,6 +1,7 @@
"""
A JupyterHub authenticator class for use with CILogon as an identity provider.
"""

import os
from fnmatch import fnmatch
from urllib.parse import urlparse
Expand Down
26 changes: 1 addition & 25 deletions oauthenticator/generic.py
Original file line number Diff line number Diff line change
@@ -1,6 +1,7 @@
"""
A JupyterHub authenticator class for use with any OAuth2 based identity provider.
"""

import os
from functools import reduce

Expand Down Expand Up @@ -60,20 +61,6 @@ def _login_service_default(self):
""",
)

username_claim = Union(
[Unicode(os.environ.get('OAUTH2_USERNAME_KEY', 'username')), Callable()],
config=True,
help="""
When `userdata_url` returns a json response, the username will be taken
from this key.
Can be a string key name or a callable that accepts the returned
userdata json (as a dict) and returns the username. The callable is
useful e.g. for extracting the username from a nested object in the
response.
""",
)

@default("http_client")
def _default_http_client(self):
return AsyncHTTPClient(
Expand Down Expand Up @@ -113,17 +100,6 @@ def _default_http_client(self):
""",
)

def user_info_to_username(self, user_info):
"""
Overrides OAuthenticator.user_info_to_username to support the
GenericOAuthenticator unique feature of allowing username_claim to be a
callable function.
"""
if callable(self.username_claim):
return self.username_claim(user_info)
else:
return super().user_info_to_username(user_info)

def get_user_groups(self, user_info):
"""
Returns a set of groups the user belongs to based on claim_groups_key
Expand Down
1 change: 1 addition & 0 deletions oauthenticator/github.py
Original file line number Diff line number Diff line change
@@ -1,6 +1,7 @@
"""
A JupyterHub authenticator class for use with GitHub as an identity provider.
"""

import json
import os
import warnings
Expand Down
1 change: 1 addition & 0 deletions oauthenticator/gitlab.py
Original file line number Diff line number Diff line change
@@ -1,6 +1,7 @@
"""
A JupyterHub authenticator class for use with GitLab as an identity provider.
"""

import os
import warnings
from urllib.parse import quote
Expand Down
1 change: 1 addition & 0 deletions oauthenticator/globus.py
Original file line number Diff line number Diff line change
@@ -1,6 +1,7 @@
"""
A JupyterHub authenticator class for use with Globus as an identity provider.
"""

import base64
import os
import pickle
Expand Down
1 change: 1 addition & 0 deletions oauthenticator/google.py
Original file line number Diff line number Diff line change
@@ -1,6 +1,7 @@
"""
A JupyterHub authenticator class for use with Google as an identity provider.
"""

import os

from jupyterhub.auth import LocalAuthenticator
Expand Down
1 change: 1 addition & 0 deletions oauthenticator/mediawiki.py
Original file line number Diff line number Diff line change
@@ -1,6 +1,7 @@
"""
A JupyterHub authenticator class for use with MediaWiki as an identity provider.
"""

import json
import os
from asyncio import wrap_future
Expand Down
Loading

0 comments on commit 18f2d19

Please sign in to comment.