-
Notifications
You must be signed in to change notification settings - Fork 781
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Backups are broken since upgrade to 1.27 #4308
Comments
Hi @franco-martin would it be possible for you to use |
Sorry I missed that detail, I already tried that and it generates an empty
tar.gz file. I linked another issue where all my tests were conducted. I
got a backup you can use to replicate the problem, how can I send it to you?
…On Mon, Nov 20, 2023, 02:55 Konstantinos Tsakalozos < ***@***.***> wrote:
Hi @franco-martin <https://github.com/franco-martin> would it be possible
for you to use microk8s.dbctl backup -o backup? microk8s.dbctl is the
expected way to take backups.
—
Reply to this email directly, view it on GitHub
<#4308 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/AK6RJQF4GYPJXCYK2ZTLXUTYFLWFBAVCNFSM6AAAAAA7RGEBAWVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMYTQMJYGI3TGOJRGE>
.
You are receiving this because you were mentioned.Message ID:
***@***.***>
|
Folks I understand you all have a lot of work to do, but please communicate if this is not a priority. Ghosting people that are reporting issues and willing to troubleshoot them is not the way to get more contributors. |
Hi @franco-martin sorry for the late reply. My main issue is that in my tests backups do not produce an empty file.Even if you sent me an empty tar.gz I would not be able to do anything with it. Can you help me reproduce this? Using VMs can you get me through the steps you follow that result an empty tar ball, thank you. |
I have a backup that when imported into a cluster, breaks the backup
functionality. How do I send it privately?
…On Mon, Nov 27, 2023, 06:30 Konstantinos Tsakalozos < ***@***.***> wrote:
Hi @franco-martin <https://github.com/franco-martin> sorry for the late
reply. My main issue with is that in my tests backups do not produce an
empty file.Even if you sent me an empty tar.gz I would not be able to do
anything with it. Can you help me reproduce this? Using VMs can you get me
through the steps you follow that result an empty tar ball, thank you.
—
Reply to this email directly, view it on GitHub
<#4308 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/AK6RJQHBRIBFRJJPH7TZPYTYGRMRXAVCNFSM6AAAAAA7RGEBAWVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMYTQMRXGQ2TSNBUGM>
.
You are receiving this because you were mentioned.Message ID:
***@***.***>
|
Lets talk on the #microk8s channel over at the kubernetes slack. I am kjackal there. |
Updated the description with reproduction steps and a publicly available download link. Hopefully thatll speed things up |
Hi @franco-martin, going though the steps you provided I was not able to get to a functioning cluster. |
I just tried it in a new node and it worked. Whats failing on your side? Im testing 1.28/stable went through step by step and it works every time.
|
I just updated the description. I built a script that automates the process. I verified it in a brand new t3.medium instance on AWS and it reproduced. |
I'm also seeing this issue of empty backup files. They contain a single empty directory. My cluster is on 1.29, but I don't know when the problem started. |
Also seeing this issue, even with v1.29.2 (latest version of the 1.29/stable track). |
Hello everyone. Doesn't make a difference if we only use |
According to https://microk8s.io/docs/restore-quorum#stop-dqlite-on-all-nodes-1 it is required to stop all dqlite instances using |
It looks like you stumbled across the microk8s documentation that explains recovering a multi-node cluster after a failure with a lost quorum. The issue explained in this thread however deals with non-working backups using microk8s dbctl in general, even during normal cluster operation without any known issues. A healthy cluster shouldn't have to be stopped via As the author of this thread and many others mentioned (me included) the I just reproduced the problem with the following steps:
So far so good. Seems to be working with 1.26.14.
This seems to be exactly the problem that the author described above. |
Same issue, it does not work even with 1.30, problem is I cannot revert to older version since when I revert cluster stop working.
|
Hi, is there anything we users can do to diagnose the issue?
|
@sbidoul I tried a bunch of things, but without much success. What you could do is debugging "https://github.com/canonical/microk8s/blob/master/scripts/wrappers/dbctl.py" with an existing cluster. I haven't had the time to do that, but if there's business interest in this, you might be able to convince a stakeholder to back you up in the name of open source. |
I traced a little bit until the kine No time to dig deeper for now... |
Summary
Backup consists of a 213-215 byte tar.gz file containing nothing. This started happening after upgrading to 1.27
Running
/snap/microk8s/current/bin/migrator --endpoint unix://${SNAP_DATA}/var/kubernetes/backend/kine.sock:12379 --mode backup-dqlite --db-dir ./
hangs for more than a day and never completes.I created another issue but responses stopped 3 weeks ago. #4259
What Should Happen Instead?
The backup should work and probably (and statistically) be around 90mb.
Reproduction Steps
bash ./script YOUR_IP
microk8s kubectl get nodes
. You should have two nodes.Introspection Report
inspection-report-20231019_220512.tar.gz
Can you suggest a fix?
Are you interested in contributing with a fix?
Ill help with whatever I can
The text was updated successfully, but these errors were encountered: