Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add storage support. #18

Merged
merged 1 commit into from
Jan 24, 2017
Merged

Add storage support. #18

merged 1 commit into from
Jan 24, 2017

Conversation

tkuhlman
Copy link
Contributor

The latest committed version of amulet and juju-deployer are needed for storage support in the tests.

Copy link
Collaborator

@freeekanayaka freeekanayaka left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I have a comment about a bit I don't fully understand.

Also, can you please rebase on master? I think the public-url branch that landed has some overlap with this (according to the diff).

log("Setting Jenkins home to {}".format(home))
unitdata.kv().set("jenkins.home", home)
# re-import paths to force update
importlib.reload(paths)
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I don't understand why this is needed. Where does this branch change global variables defined in the paths module?

Note also that the global variables defined in the paths module are meant to be constants, and should not be changed at run time. If you want something to be variable, than please make it a regular variable that is passed around.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The previous line (not counting the comment) changes the jenkins.home unitdata variable which is what many of the variables in paths are based on. Jenkins itself then has its home reset if needed with 'configuration.set_home()'. If the reload of paths isn't done then when any subsequent functions run they don't work correctly.

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I'm failing to see how the jenkins.home unitdata variable and the global variables in the paths module are related. Where is the code that connects them?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

That was a change I made also, https://github.com/jenkinsci/jenkins-charm/pull/18/files#diff-51fc11c4cd6e41d328a7e7d4f68cf801R8

The fact that this change is part of the diff just adds to the argument that these shouldn't be globals anymore.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I switched to moving the JENKINS_HOME rather than modifying the Jenkins config in the updated review. That removes this section.

@tkuhlman
Copy link
Contributor Author

This is based on master, I just checked and it says the branch is up to date. Where there specific lines you see as overlap?

Copy link
Collaborator

@freeekanayaka freeekanayaka left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I didn't dig too deep yet, but it seems to be that:

  1. The logic could be simplified and it could take more advantage of the mechanics of reactive hooks.

  2. Some of the logic could be moved from reactive.py to lib/ modules, opening the doors to better unit test coverage.

See the comments inline.

status_set("maintenance", "Adding Jenkins storage")
homedir = storage_get()['location']
unitdata.kv().set("storage.dir", homedir)
set_state("reset.jenkins.storage")
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Creating a new state and a new unitdata entry seems a bit convoluted and indirect.

Can't we just have something like:

@hook('jenkins-storage-detaching')
def detaching():
    status_set("maintenance", "Adding Jenkins storage")
    storage = Storage()
    storage.attach()

Where the "storage.attach()" call would perform more or less the same logic performed by the "reset_jenkins_storage" in this file.

The detaching hook could be implemented similarly.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yes the unitdata entry was an artifact from my previous code you are right it isn't useful anymore. I'll remove it.

@@ -15,3 +17,39 @@
INITIAL_PASSWORD = os.path.join(SECRETS, "initialAdminPassword")
LAST_EXEC = os.path.join(HOME, "jenkins.install.InstallUtil.lastExecVersion")
LEGACY_BOOTSTRAP_FLAG = os.path.join(HOME, "config.bootstrapped")


class Storage(object):
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

It feels this class should live in its own lib/charms/layer/jenkins/storage.py module. This module should be just be concerned with hard-wired path constants.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

done

remove_state("jenkins.bootstrapped")
bootstrap_jenkins()
else:
status_set('active', 'Ready')
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I'm a bit confused by this last bit. Can you clarify why this last bit of logic (the if/else block) is needed at all?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

No need to trigger a bootstrap if it hasn't been done yet, the next Juju hook should trigger it, that is the else case. The status_set was simply to leave this hook in the default state again rather than have it still say, 'Configuring Jenkins Storage'

The if state is because bootstrap needs to be re-triggered to correctly setup Jenkins if had previously been run. This will be encountered if you add storage to an already running application.

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Hm, I now realize the issue a bit better. Essentially from the user perspective the Jenkins deployment would be reset from scratch in the case of "if get_state('jenkins.bootstrapped')", as we discussed on IRC, if I recall correctly.

I'm starting to think that if you don't have a concrete use case for attaching storage after initial deployment, I'd prefer that we ignore the attach/detach storage events completely for now, and perhaps set the status to some error code (e.g. "Attaching storage after deployment not supported").

Alternatively, I'd tend to change my mind and say that we should copy existing data to the new directory (and do that in a separate PR). Otherwise, the behavior is a bit surprising: as I user I prefer to be told that some action is required rather than having my Jenkins apparently reset (I'd have to know that the data has been saved in .bak).

Sorry for the back and forth, and feel free to ping me again on IRC.



@when("reset.jenkins.storage")
def reset_jenkins_storage():
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I believe most of the logic in this function could be moved to the Storage class, so you can have unit tests coverage for it.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Perhaps I'm missing something but I'm not sure how the unit tests will work with the remove_state and get_state calls and regardless since those are reactive states they seem like they should be in reactive/jenkins.py


jenkins_installed = get_state("apt.installed.jenkins")
if jenkins_installed:
service_stop('jenkins')
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Can't we avoid this check by requiring this reactive state hook to be fired only once jenkins is installed?

For example

@when("jenkins-storage-attached", "apt.installed.jenkins")
def attach():
    # ...

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yes you could but I realized that in the case where storage was specified on the charm deploy this hook will fire before installing Jenkins. As it is now when Jenkins installs it populates the storage directory immediately and is ready for use. If you wait until after Jenkins is installed you have to restart Jenkins to repopulate the dir and possibly have to redo bootstrapping. It is cleaner to do it correctly from the beginning rather than watching the install restart Jenkins repeatedly and repeat steps for each dir so I thought it worth putting in this if statement.

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Indeed I think this issue is related to how we (don't) handle data migration in case storage is attached post-deployment. See also my reply to the other comment below.

If we had support for transparent data migration I think your concerns would be mitigated. It's true that we'd have to "unnecessarily" copy over a few files (that should be real fast) and restart the service, but this sounds a negligible cost from the user perspective (probably a few seconds altogether on regular hardware).

The benefit would be that we would consolidate code to have a single flow and simplify testing and maintenance. In these cases, I'd tend to prefer to pay the cost of a little overhead and get simpler code in return.

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

And for clarity, in case we copy the data over we could still use the symlink, so there would be no need to rewrite files in /etc/default or anything.

remove_state("jenkins.bootstrapped")
bootstrap_jenkins()
else:
status_set('active', 'Ready')
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Hm, I now realize the issue a bit better. Essentially from the user perspective the Jenkins deployment would be reset from scratch in the case of "if get_state('jenkins.bootstrapped')", as we discussed on IRC, if I recall correctly.

I'm starting to think that if you don't have a concrete use case for attaching storage after initial deployment, I'd prefer that we ignore the attach/detach storage events completely for now, and perhaps set the status to some error code (e.g. "Attaching storage after deployment not supported").

Alternatively, I'd tend to change my mind and say that we should copy existing data to the new directory (and do that in a separate PR). Otherwise, the behavior is a bit surprising: as I user I prefer to be told that some action is required rather than having my Jenkins apparently reset (I'd have to know that the data has been saved in .bak).

Sorry for the back and forth, and feel free to ping me again on IRC.


jenkins_installed = get_state("apt.installed.jenkins")
if jenkins_installed:
service_stop('jenkins')
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Indeed I think this issue is related to how we (don't) handle data migration in case storage is attached post-deployment. See also my reply to the other comment below.

If we had support for transparent data migration I think your concerns would be mitigated. It's true that we'd have to "unnecessarily" copy over a few files (that should be real fast) and restart the service, but this sounds a negligible cost from the user perspective (probably a few seconds altogether on regular hardware).

The benefit would be that we would consolidate code to have a single flow and simplify testing and maintenance. In these cases, I'd tend to prefer to pay the cost of a little overhead and get simpler code in return.

@tkuhlman
Copy link
Contributor Author

Transparent data migration is a lot harder then it sounds because one of the reasons for adding storage is because it isn't a few files but a multi-gigabyte footprint. Also on application removal the storage is detached so you have to figure out the difference between a true detach of storage and just wanting to shut it all down.

You are right that I am not notifying the user at all that their data is on disk and you can manually sync, there is no real great mechanism to do so. Also though I can think of reasons to add storage after deployment I don't personally need that. Considering those I could drop support for adding storage after installation. Regardless detaching is still needed because that can trigger as part of destroying an application.

However even dropping post-deploy storage support isn't quite so simple by the time the charm gets the storage attach hook Juju has already allocated and attached the storage to the vm. If I fail at that point I have just put the charm into an unrecoverable failure state. If I just log how is the user to know that they now have storage attached that is a waste? The best is probably to fail, force the user to run juju resolved --no-retry followed by detaching the storage and then making the charm detach hook a noop.

I prefer just better docs on the limitations to attaching storage to a deployed application.

@tkuhlman
Copy link
Contributor Author

I updated the code to more smoothly handle upgrading of the charm from a non-storage version to a storage version.

Also note a new version of juju-deployer with storage support is now available in pypi so only amulet still needs to be built from source to support storage in the amulet tests.

@freeekanayaka freeekanayaka merged commit 1d60e44 into jenkinsci:master Jan 24, 2017
@ryan-beisner
Copy link
Contributor

Has anyone tried to use the updated charm without storage? I'm not succeeding in that. #34

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants