-
Notifications
You must be signed in to change notification settings - Fork 0
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Setup Nightly Backup Job #1
Comments
John, These issue tickets look like they will be super useful. Glad we could have lunch today! Thanks again for all your help. -Charles On Aug 30, 2013, at 10:09 PM, jdimatteo [email protected] wrote:
|
I'm testing out an Amanda configuration of Amanda, and I'll probably set it up on Tuesday on TOD. $ sudo apt-get install amanda-server amanda-client I'm probably going to set up a dedicated gmail account for tod sys admin, so that this account is used for email updates for backups (e.g. to notify if a backup didn't occur) and this could also be used by Jenkins (which is currently just using my personal email account). |
Thanks John, Am reading up on AMANDA now. Sounds interesting, but I'm worried how hard http://rdiff-backup.nongnu.org/ -Charles Charles Y. Lin, Ph.D. On Sun, Sep 22, 2013 at 11:52 PM, jdimatteo [email protected]:
|
So crashplan won't work well with network drives, but amazon glacier works. My buddies are using rdiff-backup to make constant snapshots for storage on Ideally, we should have weekly snapshots backed up to crusader and then -Charles Charles Y. Lin, Ph.D. On Mon, Sep 23, 2013 at 12:22 PM, Charles Lin [email protected]:
|
During the Amanda install, I was prompted for postfix configuration. However, afterwards I installed ssmtp which automatically removed postfix: I created a new gmail account [email protected], and followed the instructions at http://askubuntu.com/a/91857 :
Now system emails are sent using the [email protected] account, e.g.
|
I haven't used rdiff-backup, but it looks a lot simpler than Amanda, so it should be easier to maintain in the long run (especially if others at Dana-Farber are using it as well). I'm going to experiment with rdiff-backup and duplicity tomorrow. |
Charles / @bradnerComputation : I'm going to go ahead and setup rdiff-backup on TOD. You mentioned a mysql database on /ark -- does that exist already? If so, please let me know the database name(s) that should be backed up. Please note that rdiff-backup doesn't support a full system backup, so I'm just going to backup the 4 directories you mentioned earlier (/ark, /raider, /mnt/d0-0/share/bradnerlab/, and /ifs/labs/bradner/). I expect the initial backup to take a while to transfer over the network (possibly days, depending on the local network speed). The initial backup will use a lot of local network bandwidth -- please let me know if there is any concern over using too much bandwidth and if I should throttle the transfer (I suggest we just let the backup go un-throttled and only throttle it if someone complains). Also, please let me know if you plan to restart TOD before the backups complete (I'll let you know when it is complete). After the initial backup is complete, subsequent backups should be fast and use little bandwidth (since they will be incremental backups, only copying over the changes). I'm going to create a Jenkins Backup job to run automatic weekly backups (probably to run on Saturdays at 3 AM). (You can also manually trigger a backup if you like by clicking the build button for the Backup job.) If any of these backups fail, an email will be sent to both you and I. I'm going to configure Jenkins security to make the backup jobs only visible/runnable by you and I, and only configurable by the bradneradmin user. On a github wiki page, I'm going to document the backup configuration and detailed instructions (with examples) of how to restore from backup. After I get the rdiff-backups running, configured, and documented I'll start working on evaluating/planning duplicity backups to Amazon Glacier. Please let me know if you'd like me to hold off on any of this, or if you have any concerns or questions. |
John, This sounds good.
This looks great, thanks again for your help! |
Charles, I'm going to go ahead and start the backups tonight. Please let me know if you'd like me to hold off, if you setup the database yet, and/or any TOD/network/crusader downtime is planned. Sent from my iPhone On Sep 24, 2013, at 10:06 AM, bradnerComputation [email protected] wrote:
|
Thanks Jon. Let's only back up /ark at first to make sure everything goes well. We have just recently created a new MySQL database called seqDB. This should be stored on /ark. -Charles
|
I installed rdiff-backup:
But the crusader mount doesn't have fully functioning file permissions, e.g. when I touch a file I get an error and it shows the wrong user owner:
Looking through the bradneradmin .bash_history, it looks like crusader was mounted like this:
It looks like maybe the permissions aren't (as much of) an issue for root:
In a screen session, the ark backup has started:
|
The backed resulted in many errors such as the following, so I aborted the backup:
I think this error is related to the crusader CIFS deficiencies with rdiff-backup, e.g. as described here: http://rdiff-backup.nongnu.org/FAQ.html#cifs |
Charles / @bradnerComputation : is it possible to mount crusader using something besides CIFS/SAMBA? For example, is it possible to mount it as NFS? This would make both backups and restores simpler. I'm going to hold off on resuming the backup until I hear back from you. I could probably get CIFS working, but it looks like it might be a headache according to the rdiff-backup documentation. According to http://rdiff-backup.nongnu.org/FAQ.html#cifs , "Using a CIFS or smbfs mount as the mirror directory has been troublesome for some users because of the wide variety of Samba configurations." Please feel free to forward me the make/model of the storage array and I can help evaluate any options besides CIFS. Thanks |
John, NFS mounting of crusader is easy. I'll try to get that working monday. The Do you think NFS mounting crusader will solve these problems? Charles Y. Lin, Ph.D. On Sat, Sep 28, 2013 at 1:21 AM, jdimatteo [email protected] wrote:
|
Charles, yes, I think NFS mounting will solve these problems. The rdiff-backup documentation suggests that CIFS/Samba is only a problem on the backup mirror (i.e. /crusader), but the the files to backup (i.e. /ark) are fine on CIFS/Samba. Sent from my iPhone On Sep 28, 2013, at 10:47 AM, bradnerComputation [email protected] wrote:
|
John, Through much brute strength and ignorance, I think I was able to nfs mount crusader. Have a look at the /etc/fstab to see if I did it correctly. Also, the way i have the system set up now, different users are mounting through their smb accounts the same drive. For instance /ark/home/af661/ressrv19/projects/ is the same place as /ark/home/cl512/ressrv19/projects/ Will this be problematic w/ rdiff and cause multiple copies? If so, we can target the backup more carefully. -Charles |
Hi Charles, The ark backup completed. It was only about 12 GB, so it just took a few minutes. I included the "--exclude-other-filesystems" option to rdiff-backup, so none of the ressrv19/projects/ mounts were included in the backup. Do any of the mounts under /ark/ (e.g. /ark/home/af661/ressrv19/projects/) need to be backed up to /crusader? I included some notes below. I'll setup a Jenkins job to run this backup every Friday, along with backing up the other directories we discussed. I'll document the backup/restore procedure once it is configured in Jenkins. Regards, Notes
Simulating loosing a file:
Restore the file:
Verify restore occurred properly:
The /crusader NFS permissions still seem a little screwy (all users are mapped to user nobody and group nogroup). rdiff-backup handles this nicely though, and restored files have the correct users. So I guess the /crusader mount configuration is OK. |
@bradnerComputation / Charles: The backup to /crusader is configured to run automatically every Saturday. I documented the configuration here (including a list of directories/databases being backed up and where they are being backed up to): https://github.com/BradnerLab/SystemAdmin/wiki#backups The initial backup is still running, and should complete later today. A few questions/comments:
I plan on setting up the offsite backups separately next week (or perhaps later) with #3 . PS |
I discussed with Charles on the phone on Monday, and based on that I made the following changes:
|
run backups at 3 AM
ask Charles:
initially just setup an rsync job triggered by cron or jenkins
document how the backups are done and how to do a restore. probably setup a wiki on this SystemAdmin repo
look into why we are backing up files, how important it is that the backups are available, and whether a simple rsync is really the best plan forward
The text was updated successfully, but these errors were encountered: