-
Notifications
You must be signed in to change notification settings - Fork 2
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Error in dataloader Tibetan file #414
Comments
Seems to have been resolved but I haven't checked all files. |
This is fixed for Tibetan but the problem remains for Sanskrit and Pali
(matches missing in table view). I will fix this by uploading new data
later today
…On Wed, Jan 8, 2025 at 7:20 AM Ven. Vimala ***@***.***> wrote:
Seems to have been resolved but I haven't checked all files.
—
Reply to this email directly, view it on GitHub
<#414 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/AEPC7GCVWTKA3R5QEKYE6EL2JU65TAVCNFSM6AAAAABUT433FKVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDKNZXHEZTQNJZGM>
.
You are receiving this because you were assigned.Message ID:
***@***.***>
|
I will reopen issue and leave it in the review section to check later. I hadn't actually noticed it for the Pali files I checked a few days ago. |
In theory at least the bug should apply there as well, but I didn’t do an
a/b comparison…
…On Wed, Jan 8, 2025 at 7:24 AM Ven. Vimala ***@***.***> wrote:
I will reopen issue and leave it in the review section to check later. I
hadn't actually noticed it for the Pali files I checked a few days ago.
—
Reply to this email directly, view it on GitHub
<#414 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/AEPC7GEBDXARIXDFW5YJGLD2JU7MVAVCNFSM6AAAAABUT433FKVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDKNZXHE2DQOBRG4>
.
You are receiving this because you were assigned.Message ID:
***@***.***>
|
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
In the Tibetan file BO_TZ9_TA-039 there is an error:
The number of matches shown in the top right is 221, yet there is no data. This is so in Table View and Graph View while Text View shows matches.
Looking at the database it looks like there is nothing listed in the parallels_sorted_file:
(Both Table View and Graph View depend on the parallels_sorted_file)
But there are parallels in the parallels DB collection:
So these 221 parallels should be shown in the parallesl_sorted_file.
It happens with many files but it is most visible on Terzo.
An other example for instance is: BO_NG_0001
Here the parallel_sorted_file lists this:
While there are actually 12673 total matches!!!
And why is the number of matches in the parallels_sorted_file different for those that are sorted by length???
I suspect that this error is there for ALL Tibetan texts but for texts that have a large number of matches it is not so clear to see because they give at least some matches in Table View and Graph View.
The text was updated successfully, but these errors were encountered: