feat: Convert the BMW encoding to JSON #16

cindyli · 2023-08-03T17:49:16Z

Description

This pull request converts the BMW encoding to a JSON file to be used for the future development.

Steps to test

Refer to the document Convert BMW encoding to JSON about steps to convert.

Additional information

Due to the copyright concern, the original BMW encoding files are not included in this pull request.

amb26 · 2023-08-03T18:27:13Z

docs/ConvertBMWToJSON.md

+that will serve as the foundation for implementing the BMW input method.
+
+BMW encoding documents are in PDF format. These PDFs are composed by digitalized images of orginal
+books. The coversio method is:


Typo coversio -> conversion -
Also "digitized" is better rendering of "digitalized"

amb26 · 2023-08-03T18:27:43Z

docs/ConvertBMWToJSON.md

+books. The coversio method is:
+
+1. Split every single page in a PDF into .jpg files
+2. Use OCR library to extract texts from .jpg files


Do we have a working OCR library for this? Could we provide more detailed instructions?

amb26 · 2023-08-03T18:28:26Z

docs/ConvertBMWToJSON.md

+BMW encoding documents are in PDF format. These PDFs are composed by digitalized images of orginal
+books. The coversio method is:
+
+1. Split every single page in a PDF into .jpg files


Split each page in the PDF into its own .jpg file

amb26 · 2023-08-03T18:29:28Z

utils/README.md

+
+**File formats**
+
+1. The content of any .txt file in the `source_txt_path` directory


Sample content of a .txt file in the ...

cindyli · 2023-08-03T19:02:15Z

Thanks for the review, @amb26. All addressed and ready for another round.

fix: adding missing bci-av-ids in ../data/bmw.json

CLAassistant · 2023-09-19T13:34:01Z

All committers have signed the CLA.

…into feat/bmw

fix: making some fixes and adding missing bci-av-ids

cindyli added 5 commits June 7, 2023 14:33

feat/OCR: extract english texts from images

47c0e66

chore: update documentation

a2c9806

Merge branch 'main' into feat/ocr

ce82484

feat: convert the BMW encoding to JSON

0cc23f1

fix: linted

9fc6571

amb26 reviewed Aug 3, 2023

View reviewed changes

fix: address review comments

440e849

cindyli and others added 8 commits August 3, 2023 19:06

fix: improve the conversion of the BMW encoding

d43b69c

fix: clean up

9ca7466

feat: add data files as the copyright restriction is lifted

6e0c6d0

fix: update the latest bmw.json

22d0ad5

fix: update the latest bmw.json

1b439a1

fix: use spacy to fill in null BCI-AV-IDs in bmw.json

c42b51d

adding missing bci-av-ids

aa20f46

Merge pull request #1 from hlridge/feature/add-missing-bci-av-ids

c66ba30

fix: adding missing bci-av-ids in ../data/bmw.json

cindyli and others added 10 commits September 19, 2023 14:51

fix: fill up IDs for ordinal numbers in bmw.json

b33b9d5

fix: add BCI-AV-ID for "ten" in bmw.json

d264ac1

fix: repopulate the "encoding_symbols" section in bmw.json

4d21c2d

fix: add "mind" in noun form into bmw.json

f2434a0

adding missing bci-av-ids

f79723c

fix: add script for populating "encoding_symbols" section in bmw.json

3721542

fix: linted

89aeb53

Merge remote-tracking branch 'hannes/feature/add-missing-bci-av-ids' …

2f84017

…into feat/bmw

fix: add missing encodings in bmw.json

aa6c379

fix: remove "You+" from "encoding_symbols" section in bmw.json

1208ffa

fix: add script to find missing encodings

d9f0557

cindyli mentioned this pull request Sep 22, 2023

Feat: Create the keys json file for rendering BMW palette #17

Open

hlridge and others added 8 commits September 25, 2023 02:05

making some fixes and adding missing bci-av-ids

0249fcc

Merge pull request #3 from hlridge/feature/add-missing-bci-av-ids

4f32cd3

fix: making some fixes and adding missing bci-av-ids

fix: update the id for "Touch Talkers" in bwm.json

754fadc

fix: re-populate "encoding_symbols" section in bmw.json

25db169

fix: apply accomodating bliss to SVO messages in bmw.json

9c9eada

fix: add indicators for verb tenses

8ad7cbe

fix: repopulate the "encoding_symbols" section in bmw.json

61e171a

fix: more fixes in bmw.json; improve documentation

663afb4

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: Convert the BMW encoding to JSON #16

feat: Convert the BMW encoding to JSON #16

cindyli commented Aug 3, 2023

amb26 Aug 3, 2023

amb26 Aug 3, 2023

amb26 Aug 3, 2023

amb26 Aug 3, 2023

cindyli commented Aug 3, 2023

CLAassistant commented Sep 19, 2023 •

edited

Loading


		File formats

		1. The content of any .txt file in the `source_txt_path` directory

feat: Convert the BMW encoding to JSON #16

Are you sure you want to change the base?

feat: Convert the BMW encoding to JSON #16

Conversation

cindyli commented Aug 3, 2023

Description

Steps to test

Additional information

amb26 Aug 3, 2023

Choose a reason for hiding this comment

amb26 Aug 3, 2023

Choose a reason for hiding this comment

amb26 Aug 3, 2023

Choose a reason for hiding this comment

amb26 Aug 3, 2023

Choose a reason for hiding this comment

cindyli commented Aug 3, 2023

CLAassistant commented Sep 19, 2023 • edited Loading

CLAassistant commented Sep 19, 2023 •

edited

Loading