Skip to content

Latest commit

 

History

History
1298 lines (1083 loc) · 113 KB

CHANGELOG.md

File metadata and controls

1298 lines (1083 loc) · 113 KB

Changelog

v0.11.13 (2024-06-05)

Fix

  • ocr: Another fix for #274 (2a7b401)

v0.11.12 (2024-06-05)

Fix

  • PYPI email verification error for new release (545d8ea)

v0.11.11 (2024-06-05)

Fix

v0.11.10 (2024-02-22)

Fix

  • OCRFormatter: Add same_line_ratio_threshold class variable (#272) (5b47b77)

v0.11.9 (2023-12-21)

Fix

  • ocr: Checking the abnormal post correction feature added (#264) (e94d65d)

Documentation

  • Remove openpecha.org docs (3b2e2f1)

v0.11.8 (2023-10-05)

Fix

  • ci: Pin python-semantic-release version (a10c298)
  • dependency: Make commit parser happy (6c6eae1)

v0.11.7 (2023-05-29)

Fix

  • github-storage: Set current checkout branch as default (e2d6318)

v0.11.6 (2023-05-17)

Fix

  • elan: Indent ANNOTATION_VALUE tag (946d45d)
  • elan: Put end_payload in the end (adb9848)

v0.11.5 (2023-05-03)

Fix

  • ocr: Handle case where pecha_id is None (8740f22)

v0.11.4 (2023-03-31)

Fix

  • utils: Download private pecha (5ce7113)
  • storage: Generate auth remote url (5172865)

v0.11.3 (2023-03-29)

Fix

  • core/pecha: Add base_names_list prop to pecha (58e7135)
  • core/pecha: Pecha publish method accepts branch (5c599ca)

v0.11.2 (2023-03-21)

Fix

  • core/pecha: Overwriting bases in metedata (f9033eb)

v0.11.1 (2023-03-21)

Fix

  • core/pecha: Make arg optional (67fd90b)

v0.11.0 (2023-03-17)

Feature

  • elan: Initial implementation of elan serializer (3f8cd41)

Fix

  • elan: Get media_url from TranscriptionTimeSpanLayer (d060818)
  • pecha: Create meta.bases only if it doesn't exist (38e8d7e)

v0.10.0 (2023-03-15)

Feature

  • otranscribe: Initial implementation for oTranscribe formatter (13e2c83)
  • Add TranscriptionTimeSpanLayer (096f8ff)
  • Add a Layer for transcription (454a304)
  • Add annotation for transcription (ee669a3)

Fix

  • otranscribe: Take directory as an input path (9e1e9b5)
  • core/pecha: Pass layerEum obj instead of str (bba5d1f)
  • otranscribe: Add test data for oTranscribe formatter (acd9a22)
  • Read_layers_file file, get_base_metadata, about (14200c8)

v0.9.25 (2023-03-14)

Fix

  • po-exporter: Test case has been updated (267b8b3)

v0.9.24 (2023-02-13)

Fix

  • core: Check existence before unlink (190147e)

v0.9.23 (2023-01-23)

Fix

  • google_vision: Fix rare case in Google Vision json output (#236) (90b5dd6)

v0.9.22 (2023-01-20)

Fix

  • baregit: Fix access to non existing layer (7ab4d0b)

v0.9.21 (2023-01-19)

Fix

  • rdf: Better RDF export for NorbuKetaka texts (b4e3013)

v0.9.20 (2023-01-05)

Fix

  • alignment: All test passed for alignment (eb56db6)
  • alignment: Updated from OpenpechaFs to OpenpechaGitRepo (2a6bcb7)
  • alignment: Pecha publish updated to upload the assets (3e5e032)
  • alignment: Fix sentence segmenter (a3b0977)
  • alignment: Changed back the update_alignment in tmx (e6a0bee)
  • alignment: All test passed (e42d9c5)
  • alignment: Updated other tests (c32dbf1)
  • alignment: Updated test and tmx (32c0c88)
  • alignment: Tmx alignment reformat (2c26bd7)
  • alignment: Updated create_opf of alignment (092b9ca)

v0.9.19 (2022-12-30)

Fix

  • hotfix: Option to not use blupdate (418aa36)

v0.9.18 (2022-12-21)

Fix

  • pecha: Add storage attr to OpenPechaGitRepo (186b307)

v0.9.17 (2022-12-17)

Fix

  • core: Create OpenPechaGitRepo (ebf9c19)

v0.9.16 (2022-12-12)

Fix

  • test-formatter: Updated formatter test to use remove duplicates option (be796bd)

v0.9.15 (2022-12-12)

Fix

  • tests: Tests pass, do not use option in other tests (411ef5b)
  • ocr: Option to remove duplicate symbols, #220 (a4c7fe0)
  • formatter-ocr: Checks if bbox overlaps more than 60% (c081ef3)
  • formatter-ocr: Added remove duplicates and overalap (cf11570)

v0.9.14 (2022-11-30)

Fix

  • Replace AnnType with LayerEnum (6a995fc)

v0.9.13 (2022-11-30)

Fix

  • Replace AnnType with LayerEnum (e0feb81)

v0.9.12 (2022-11-28)

Fix

  • Remove AnnType dependency which is redundant (fce9d7d)

v0.9.11 (2022-11-28)

Fix

Documentation

  • Switch to mkdir-material insider (e1c0a45)

v0.9.10 (2022-11-23)

Fix

  • hocr-formatter: Removed the update for empty volume bugs (9193669)
  • hocr-formatter: Unzip html.zip and read html (9791dce)

v0.9.9 (2022-11-22)

Fix

v0.9.8 (2022-11-07)

Fix

  • pecha: Handle box rotation (182c674)

v0.9.7 (2022-10-20)

Fix

  • test_cli: Remove full path assert in download_pecha (5bd01af)
  • cli: Downgrade click version and add cli test (a82f153)

v0.9.6 (2022-10-20)

Fix

  • Disable git rebase when downloading pecha (d1c87c6)

v0.9.5 (2022-10-14)

Fix

  • cli: Add batch download command (ef5a7d8)
  • cli: Download cmd now only download single pecha (5ee31c9)

v0.9.4 (2022-10-14)

Fix

  • pecha: Update is private with geo restriction availibilty (3e9b3c2)
  • pecha: Used storage object to get org name and token. update pecha about (66696f2)
  • googel-vision: Return none and o for get_bboxes_for_page if page is empty (a6e01e8)
  • google-vision: Check first char for unicode category (3deaaa2)
  • ocr: Get bdrc_scan_id from data provider (85aa344)
  • catalog-manager: Updated create_opf to the new format and add data_provider (4f492ad)
  • test: Assert annotation object updated (3c44292)
  • ocr: Add pecha id default as none (a7cc7ad)
  • pecha: Update pecha about and add is private to pecha (54428a5)
  • pecha: Publish existing pecha with changes (99af8b0)
  • pecha: Publish asset in release with pecha (efad1ba)
  • storages: Add check geo_restriciton in meta for private repo (09b1162)

v0.9.3 (2022-10-07)

Fix

  • utils: Add download release (1b93132)

v0.9.2 (2022-10-07)

Fix

  • formatter.ocr: Use openpecha.core module to create pecha and layers (35ad913)
  • ocr: Add return type for get_metadata (b739372)
  • test: Update test (b651531)
  • ocr: Create opf method return openpechafs object (57e611f)

v0.9.1 (2022-10-05)

Fix

  • hocr-parser: Fix documentation (b7b6e95)
  • hocr-parser: Add documentation and a bug fix (cc5ffd8)

v0.9.0 (2022-10-05)

Feature

  • hocr: Parser base code implemented (e7ccec4)

Fix

  • hocr-parser: Updated google vision expected output (956933d)
  • hocr-parser: Add language tag function (e6c9e8e)
  • hocr-parser: Add data_provider and formatter function for IA format hocr (cc9b37c)
  • hocr-parser: Added IA format hocr test and test case (932e6fc)
  • hocr-parser: Updated the hocr (ec2fae3)
  • hocr-parser: Added ocr directory to formatters (72e2bba)
  • hocr-parser: Added hocr test (654ef6c)
  • hocr_parser: Fixed hocr_parser (0c87ae8)

v0.8.33 (2022-09-30)

Fix

  • rdf: Fix software and batch keys (d6f558e)

v0.8.32 (2022-09-30)

Fix

  • rdf: Fix github URL in RDF export (d05d876)

v0.8.31 (2022-09-28)

Fix

  • rdf: Wrong volume number in export (2f87e37)

v0.8.30 (2022-09-05)

Fix

  • hot-fix: Fix infinite loop (5a4a84d)

v0.8.29 (2022-08-29)

Fix

  • hot-fix: Default value of vertices are added (ebe82ae)

v0.8.28 (2022-08-29)

Fix

  • hot-fix: Google ocr parser link has been passed as parameter (9b6bc6c)

v0.8.27 (2022-08-22)

Fix

  • github-utils: Git hub repo visibility update function renamed (0e08f19)

v0.8.26 (2022-08-18)

Fix

  • buda: Better API logging (74366e6)

v0.8.25 (2022-08-18)

Fix

  • googel-ocr-formatter: Commit made to trigger githubaction to publish changes on pip (7f57883)

v0.8.24 (2022-08-17)

Fix

  • google-ocr-formatter: Test case added (c8f2f1f)
  • google-ocr-formatter: Copy rigth variable updated (1e247bf)
  • google-ocr-formatter: Code documated and test case updated (cb344d9)
  • googel-ocr-formatter: Space insertion done using rule base algo (a5c596a)
  • setup: Boto3 import added (0a39219)
  • tmx-create-opf: Base saving bug fixed (de88f86)

v0.8.23 (2022-08-17)

Fix

  • core-pecha: Find span info for all layers by default (973960a)
  • core-pecha: Skip loading unsported layer (9e39086)

v0.8.22 (2022-08-12)

Fix

  • google-ocr: Add restrictedInchina, access, copyright, license to metadata (2055318)

v0.8.21 (2022-08-11)

Fix

  • metadata: Add under copyright license (ceb9e39)

v0.8.20 (2022-08-10)

Fix

  • google-ocr-formatter: Meta title and author bug fixed (45b5b94)
  • google-ocr-formatter: Language layer formatter test case added (972278b)
  • google-ocr-formatter: Language layer parser methods added (8f9ff0d)
  • google-ocr-formatter: The ocr confidence index is saved in meta (b3ccd73)

Documentation

  • Add initial pecha metadata eg. (f0a07b6)

v0.8.19 (2022-08-03)

Fix

  • metadata: Refactor initial metadata (125b0ab)

v0.8.18 (2022-08-03)

Fix

  • google-ocr: Empty page bug fixed (9352526)

Documentation

v0.8.17 (2022-07-26)

Fix

Documentation

  • Add tab navigation (b8dad4a)
  • gh-action: Add job to build and deploy docs (dc86ea6)

v0.8.16 (2022-07-22)

Fix

  • config: Github org url of openpecha data updated (61a7c2c)

v0.8.15 (2022-07-22)

Fix

  • blupdate: Vol keywords are replaced by base name and related refactors are made (1eb61f3)

v0.8.14 (2022-07-18)

Fix

  • ocr-postprocessing: The bounding polys are sorted before text extraction (3537a3f)

v0.8.13 (2022-06-13)

Fix

  • Set openpecha-data org name from env var only (e2e9052)

v0.8.12 (2022-05-30)

Fix

  • core: Remove empty dict as default value in OpenPecha class (e863543)

v0.8.11 (2022-05-25)

Fix

  • metadata: Add seperate metadata classes for pecha types (904f926)
  • core: Replace source id with initial id (ca083be)
  • core: Add source and diplomatic id generator (24549dd)

v0.8.10 (2022-05-24)

Fix

  • core: Add copyright and license in metadata (1db55da)
  • core-metadata: Typos (d1b44f0)
  • core: Move metadata model into metadata.py module (9f6f1a3)

Documentation

  • metadata: Add copyright and license to metadata (b15ab56)

v0.8.9 (2022-05-24)

Fix

  • serializer: Apply index updated (be9a831)
  • serializer: Vol id variables are renamed as base_id and vol keyword in span are changed to base (1d6898d)

v0.8.8 (2022-05-18)

Fix

  • corpus: Create corpus directory (7ea0b5f)

v0.8.7 (2022-05-18)

Fix

  • core: Remove prints (0cacb3d)
  • core: Add metadata attr to OpenPecha class (8c5cb98)
  • core: Set base text metadata (5f34da2)

v0.8.6 (2022-05-17)

Fix

  • corpus: Refactor to download tokenized corpus (b68bef0)
  • corpus: Lazy downloading corpus catalog csv file (cf24270)

v0.8.5 (2022-05-12)

Fix

  • corpus-quality: Add statistics (6234b19)

v0.8.4 (2022-05-11)

Fix

Documentation

v0.8.3 (2022-05-02)

Fix

  • core: Apply blupdate everytime base is upated (a899043)
  • core-layer: Update annotation (1553f41)
  • core-annotation: Add default and options spelling in durchen (e34c157)

v0.8.2 (2022-04-27)

Fix

  • corpus-download: Pecha base path (155feb0)

v0.8.1 (2022-04-21)

Fix

  • corpus-download: Add download progress bar with tqdm (c075662)

v0.8.0 (2022-04-01)

Feature

  • Create empty ebook opf (cfb180d)
  • Add opf_path option for export command (facd1a9)
  • Add assets to release (da7c279)
  • Adapt annotations for shifted base-text (98251de)
  • Serializer of footnotes (3f0d21a)
  • Use blupdate for fuzzy matching (877556c)

Fix

  • corpus-download: Skip downloaded pecha" (f42e150)
  • corpus-download: Use authenticated requests session (f0fe570)
  • corpus: Get gh token for corpus download (94e3188)
  • download: Download suggestion implemented (5dffcd6)
  • download: Base text url updated (db1d18b)
  • doc: Documented download function in corpus module (f381a3f)
  • corpus: Download corpus module compeleted (8b9cc73)
  • Add missing init (3adacbc)
  • cli: Transifex imports in cli (de530cf)
  • Typos in cli (c8b9183)
  • Missing init file (b1ddca8)
  • OpenPechaFS path (b99a041)
  • Add cli to count non word and save in meta.yml (5ac97c5)
  • Twn non_word conter can be added (8825625)
  • core-pecha: Add pecha text quality attr in meta (e0a59cf)
  • core-pecha: Retrive span info (0fb51b9)
  • core-pecha: Add update last modified date (6285f79)
  • core: Remove layer_name arg from set_layer (e88fe77)
  • core: Set base and layer with OpenPecha (bb2f6d4)
  • core: Layer can add, get and remove annotation from it (eed2086)
  • Pydantic version (525d980)
  • core: Add remaining annotaion classes (e12b4e0)
  • core: Forbid extra parameter in Annotation init (1d0ccb0)
  • core: Add Citation annotation (bd1538d)
  • storage: Set remote url (23d4570)
  • github: Remove return in update pecha download (de1c237)
  • github: Setup repo auth when download (cf315b4)
  • github: Setup auth for download repo (a11c86e)
  • storages: Add github repo auth (5b8cf8e)
  • core: Set default output path to save work (9488419)
  • core: Rset deefault path to save work (0b11e67)
  • work: Opwork id can be searched using bdrc instance id (a97796d)
  • work: Aadd load work from id (d75d67b)
  • core: Move work in work sub-pkg (691f9ee)
  • core: Save and load workk from yaml (1fba453)
  • core: Add Work model and test (366540d)
  • core: Get id for pecha, work, alignment and collection (aa88541)
  • work: Add work test (caf8b52)
  • github-storage: Rename publisher to storage and add and remove file (557f5d8)
  • publisher-github: Remove and get repo (550c486)
  • publishers-github: Aget dpecha description from about prop (93a341a)
  • publishers-github: Cimake path optional when remove pecha (6c004f0)
  • core-publisher: Add base publisher class (7746060)
  • hfml: Page annotation updated (a7eec5e)
  • alignment-transifex: Set transifex project repo url (26bae15)
  • alignment-tmx: Add relation to alignment.yml's segmentsource (1a00972)
  • alignment-tmx: Add relation to alignment.yml's segmentsource (db65e41)
  • alignment-tmx: Add relation to alignment.yml's segmentsource (b96adc3)
  • po: Po view look up done on relation (1d81623)
  • alignment: Add import alignment cli (dc8aff0)
  • alignment: Return alignment repo path (7d0bf1c)
  • alignment: Default branch is master (413acbe)
  • alignment: Return project id and alignment title (c54b82d)
  • alignment: Base in exporter (4ce9e99)
  • alignment: Load metadata (876b817)
  • alignment-tmx: Add tmx parsers and po parsers (e462628)
  • alignment-tmx: Add tmx parsers and po parsers (cb7703c)
  • test: Test data included separately (e951fca)
  • blupdate: typos in update span (8d089f5)
  • test: Po test data separated (3a92be9)
  • test: Po test data separated (461e0f1)
  • po: Language added while po export (8fcb59c)
  • alignment-tmx: Add create alignment from tmx (049d970)
  • po: Po exporter updated (e25d1db)
  • alignment-tmx: Add create alignment from tmx (ed3588c)
  • alignment-tmx: Add create alignment from tmx (8c2c592)
  • alignment: Get po view added (b832737)
  • po: Keyerro bug fixed and functions documented (d7ead02)
  • test: Test code updated (dbbae52)
  • alignment-transifex: Add traget language to the project (8a89e01)
  • po: Po view populated to po branch (226d36b)
  • bitext: Bitext exporter implemented (538c3e7)
  • alignment-transifex: Add TM to transifex project (7d861f0)
  • po: Po exporter implemented (26bd01d)
  • google-ocr: Flag added if meta required or not (1a80bc2)
  • google-ocr: Post processing for page is done to google ocred pages (62db212)
  • save_page: Starting of text in same vol updated properly (eb273e0)
  • proofreading: Span of sub text is update bug fixed (434638d)
  • docx-serializer: Font family included in styles (f794ca5)
  • serialisers: Unify aserialize api (3faa1b5)
  • editor-serializer: Multiple tsawa and citation type supported (d65835d)
  • epub: Enum used to avoid hardcoded condition checking (306a9c0)
  • editor-formatter: Ann type supports (b8a5f5a)
  • proofreading: Subtext start update bug fixed (0fbeb4d)
  • docx-serializer: Docx serializer formatted as class (024f19a)
  • test: Pecha asset added (f97fa31)
  • test: Chapter and booknumber added (8d3a56c)
  • testcase: New test case added (fff431f)
  • epub-serializer: Multiple type of citation and tsawa annotation supported (c35d7cf)
  • testcase: Test case imporved (97d3b4b)
  • hfml-serializer: Subtext serializer bug fix (01d31c2)
  • proofreading: Branch bug fixed (5702684)
  • cli: Also check for remote branch (224e170)
  • cli: Pass repo in branch evaluate (f3feebd)
  • proofreading: Branch option and sub text update included (af04c6f)
  • download-pecha: Set fallback branch (0286d53)
  • proofreading: Vol info return list of volume details (b8164a6)
  • proofreading: Method to assist proof reading editor added (d874f91)
  • epub-test: Test case updated according to new book title tag (d2c0d7f)
  • docx: Test added to docx serializer (97c1c87)
  • test-formatter: Pedurma testcase updated according to changes made in formatter (8de3f01)
  • hfml: Self.dump replace to dump_yaml from utils (b96eb87)
  • Pagination layer name typo (29b6ede)
  • docx: Docx serializer added (c26b0cd)
  • pedurma: Formatter and serializer test case updated (d965422)
  • utils: Yaml loader and dumper is changed to Csafeloader and Csafedumper (da8ff8b)
  • pedurma-formatter: Pagination annotation changed to hfml format (bb80bbd)
  • Raise exception for pecha doesn't exist (69aa954)
  • pedurma-serializer: Doc string added (b9613ae)
  • test: Merge conflict resolved (9bb1a6c)
  • test: Test case added for pedurma formatter and serializer (64d2b98)
  • pedumra: Formatter for preview text and serializer of diplomatic text completed (d7ee2f5)
  • core-pecha: Reset layers by reading components (031b8bb)
  • core.pecha: Add rest layers (7d25a4a)
  • pedurma-formatter: Integration completed (5691391)
  • test: Test for serializers are separated (0ed687a)
  • editor: Chapter serialize correctly in editor (cd2572a)
  • pedurmaFormatter: Pedurma note formatter and serializer added (c39863b)
  • serialize: Text span bug fix (29ae052)
  • hfml-formatter: Topic end span for last page changed (a1d36de)
  • serializer: Line annotation removed (e3af107)
  • epub-serialise: Skip embedding ibook specification if epub doesnt exist (07b4acc)
  • epub-serializer: Renaming approch changed (ec620f4)
  • test-serialize: Epub serializer updated (d117d86)
  • epub-serializer: Ibook specification for proper font embedding included (107cb69)
  • text-formatter: Text-formatter added (b78698f)
  • hfml-serializer: Page index changed to imgnum (d83a86d)
  • New release (e16260c)
  • epub-serializer: Alt option added for img tag of credit page (540c6fa)
  • serializer: Index layer passed as parameter in order to avoid multiple loading of it (61a53ae)
  • epub-serializer: Verse type annotation style changed (788ecab)
  • hfml-serializer: Extra line at the end of pages bug resolved (395ec95)
  • Ann start (aa0cc38)
  • Ncreate single ann as a group (b78bcf0)
  • Specify upper and lower bound for deps (cb2a629)
  • epub-serialize: Added front page generator using meta data (6f48523)
  • editor-formatter: Skip grouping if layer is empty (bd651cd)
  • Exlude alll ann attrs with value none (c2408e4)
  • google-ocr: Add imgnum to page ann (76ccc8d)
  • epub: Set default toc level if exist in serialized html (d475e15)
  • Create layer if doesn't exist (af3ab83)
  • verse-annotation: Isverse attribute of verse type annotation changed to is_verse (4173c1a)
  • test_serializer: Testcase added for editor serializer (2c4d653)
  • editor-serializer: P tag introduced to verse components (7d4800f)
  • Grouping root-text and find verse (a1a2484)
  • Editor parser span (9242571)
  • Radd missing layers and improve test (f336aef)
  • Return output from editor serializer instead of saving (c6297c0)
  • editor-serializer: Footnote serialization enabled (53149e3)
  • Author css class (583d688)
  • editor-serializer: Added special serializer for editor (0de0dbf)
  • Aupdate base and layers (96c95b8)
  • Add editor outpur parser (ef85a5f)
  • epub-serializer: Removed credit page layer n added credit page img tag after first author (0342a17)
  • epub-serializer: Removed credit page layer n added credit page img tag after first author (044e119)
  • Cbranch checkout in pecha download (d89c607)
  • Toc level variable changed (c5c07af)
  • Toc level variable changed (3734339)
  • Assets path (6a602b6)
  • epub-serializer: Disabled font rescaling (1b63245)
  • Add save_layer method (fd2b8a8)
  • Remove 'opecha' from pecha id (064abab)
  • epub-serializer: Margin given to sabche, citation and tsawa which are not inline (4cc8637)
  • Pecha has components list (base + layer names) (bb1d206)
  • epub-serializer: Auto generation of TOC added (1db097c)
  • epub-serializer: Footnote marker and reference serializer added (1446879)
  • Save pecha (9634fa9)
  • Iread pecha id from metadata (d9f008e)
  • blupdate test (2875e86)
  • download pecha with specific branch (4f4c9b7)
  • epub-serializer: Replace regular_indented1 of para before chapter title to regular_indented1 (fc243a6)
  • tsadra-formatter: Removed is_cover and is_subtitle. created new annotation sub title (7f0967a)
  • Layer annotations are in dict (de625ff)
  • Re-relase the 0.7.3 (337b3cc)
  • Create review branch for opf (0bd7911)
  • Ireturn empty string instead of None for source meta access (b2390f9)
  • Make output_path optional (5b67259)
  • epub-serializer: Page break after chapter tag included (b481d59)
  • Cconfig for default pechas path (b92c3b7)
  • Madd pydantic in dependencies (16f6ed0)
  • Add create opf for ebook (efe8bf8)
  • tsadra-formatter: Yigchung formatter updated as corner case detected (870f198)
  • epub-serializer: Intentation adjustment bug fix (018b5de)
  • epub-serializer: Indentation bug fixed (ab405d5)
  • epub-serializer: Break within title bug are fixed by inserting break before title (fc3b8fa)
  • epub-serializer: Indentation bug fixed for verse component (819c292)
  • tsadra-parser: Credit page parser added (61c3648)
  • Make methods to module funcs (7fec0d6)
  • Csave src volumn id to base file in meta (50e2717)
  • Add needs_pecha option while pecha download (693086d)
  • Test for old annotations structure (f8ea280)
  • git-utils: Dpecha with main as default branch (9860836)
  • Missing encoding specification (051e2c2)
  • Update index layer (39f0422)
  • Blupdate testcase and add cli (18d5a0c)
  • serializer: Empty line serialize correctly (e906287)
  • Import bonltk only when in use (a7e5b48)
  • Import bonltk only when in use (3ecd475)
  • Return local path to downloaded pecha (e91fb4d)
  • Return local path of downloaded pecha (fae9e58)
  • hfml-serializer: Pagination needs to serialize last (468191b)
  • Small bugs (a1b1016)
  • pagewise: Pagewise obj created (2480e03)
  • hfm-formatter: Durchen parser added (a4bdd15)
  • serialize-base-module: Empty line serialize correctly (cb4ebca)
  • formatter-basemodule: Pecha without text formatted correctly (ff5dee2)
  • hmfl-formatter: Empty line formatted correctly (5fcbe87)
  • epub serialze: Break after 500 syllables (8d93203)
  • catalog: Move catalog manager into catalog sub-pkg (fb6b084)
  • catalog: Download corpus cli (69b82e6)
  • catalog-storage: Get base of all pechas (878a31b)
  • github: Create_file to accept org and token (7bf1562)
  • Github helper funcs to work on any org (b3aff03)
  • formatter: Missing kwarg metadata (051ca1b)
  • catalog: Create orphan branch for each layer (4ff76f7)
  • formatter: Add source metadata (1b2bbe8)
  • catalog: Add hfml item (81a982a)
  • serializer: Disable line-number for epub (3ed677e)
  • github-utils: Create prerelease and return asset download url (2b84ee3)
  • epub-serializer: Return serialized epub path (7d3e8a9)
  • test: Skip the hfml_serializer test for a new release (25fba24)
  • serializer: Unified opfpath to opf_path (9646b34)
  • hotfix: Pecha_id as arguments (e288b1d)
  • Improve log message (338e661)
  • formatter: Fix sub topic nested list (f22dcd1)
  • cli: Import error (218c2bf)
  • hfml-serializer: Save hfml text in original filename (2969d0a)
  • hfml-formatter: Add vol_id to filename mapping to metadata (b67022a)
  • hfml-formatter: Archaic regex pattern (314088d)
  • formatter: Create Global2LocalId object for every vol (1c1f4cf)
  • serializer: Add serializer method hfml and typo (d80d71f)
  • cli: Not defined 'result' (9ac51cc)
  • epub-serializer: Hfml modify (97ddde2)
  • cli: Add hfml-serializer (8260fe9)
  • Extract pecha_id from opf_path (444de8b)
  • hfml-formatter: Change id to id_ in create_opf method (a5e2fa7)
  • epub-serializer: Remove redundant pecha_id in serialize method (0a95fd0)
  • Add serializer output_path (a62d71d)
  • Missing openpecha config (d3d3d87)
  • Missing openpecha config (f94839d)
  • Compatible of hfml formatter and serializer (#66) (1ca18ce)
  • Change id to id_ (62e8cb6)
  • Add output_path optional for format command (0c9dfba)
  • Bug in loading and indexing old_layers (8418916)
  • Formatting index layer (0f6b9c1)
  • Imports and dependencies (c90407e)
  • Remove openpecha cmd config (43f60a1)
  • Test data (c9dad88)
  • Add local_id (de76e5b)
  • Accessing layer component (5cb2f7f)
  • Remove id from annotation (4f8611d)
  • Vol_id is optional to get_base_layer (ca4a2b7)
  • Passing text_id from cli (4251f9b)
  • Get page number from image file name (e2517e3)
  • Info file + version bump (777195b)
  • Image number contains c + version bump (8d9f512)
  • Invalid page number (8dfdbd1)
  • Page no. with sides (a|b) (140bc32)
  • Page no. contains 'a' (2250110)
  • Bug in custom last_id (24a119d)
  • Update last_id after 5 works processed (8f6244b)
  • Ocr-formatter first page index + version bump (faa4928)
  • Ocr-formatter test (5413b67)
  • Page number (6c69f22)
  • Missing new line at end of batch.csv (2a48cc1)
  • Remove error log file at begining for run (601d94a)
  • Cct return (d827a4a)
  • Parent dir doesn't exits (bc500b1)
  • Remove file error (7d626c4)
  • Line start with newline (850d3da)
  • Return correct cct (e8cb243)
  • Shifting n_chars bug (4e493f5)
  • Flags incorrect text id (ff4aa63)
  • Bug in shifted annotations (92352f2)

Documentation

  • core: How to add base and layer to openpecha (15f96f5)
  • Add base and layer to pecha (18a1571)
  • Add assert in add annotation in layer (5347fb4)
  • Fix docs src path (23a2a49)
  • Add managing layer docs (a8bb38a)
  • annotations: Fix correction shema and tutorial path (d0cd410)
  • annotations: Add examples and test (d45fd71)
  • annotation: Add json schema build script (c2a066d)
  • Add citation schema and example (abb8ad1)
  • Add index layer example (603fb00)
  • Add annotations types (4489e1e)
  • Add annotations page (910f8c3)
  • Setup mkdocs (e5c322e)
  • alignment-transifex: Improve documentation (353175a)
  • alignment-transifex: Add doc_strings (6a7fab5)

v0.7.83 (2022-04-01)

Fix

  • corpus-download: Skip downloaded pecha" (f42e150)

v0.7.82 (2022-03-31)

Fix

  • corpus-download: Use authenticated requests session (f0fe570)
  • corpus: Get gh token for corpus download (94e3188)

v0.7.81 (2022-03-30)

Fix

  • download: Download suggestion implemented (5dffcd6)
  • download: Base text url updated (db1d18b)
  • doc: Documented download function in corpus module (f381a3f)
  • corpus: Download corpus module compeleted (8b9cc73)

v0.7.80 (2022-03-17)

Fix

v0.7.79 (2022-03-17)

Fix

v0.7.78 (2022-03-17)

Fix

v0.7.77 (2022-03-17)

Fix

  • OpenPechaFS path (b99a041)
  • Add cli to count non word and save in meta.yml (5ac97c5)
  • Twn non_word conter can be added (8825625)
  • core-pecha: Add pecha text quality attr in meta (e0a59cf)

v0.7.76 (2022-02-15)

Fix

  • core-pecha: Retrive span info (0fb51b9)
  • core-pecha: Add update last modified date (6285f79)
  • core: Remove layer_name arg from set_layer (e88fe77)
  • core: Set base and layer with OpenPecha (bb2f6d4)
  • core: Layer can add, get and remove annotation from it (eed2086)

Documentation

  • core: How to add base and layer to openpecha (15f96f5)
  • Add base and layer to pecha (18a1571)
  • Add assert in add annotation in layer (5347fb4)
  • Fix docs src path (23a2a49)
  • Add managing layer docs (a8bb38a)

v0.7.75 (2021-12-20)

Fix

  • core: Add remaining annotaion classes (e12b4e0)
  • core: Forbid extra parameter in Annotation init (1d0ccb0)
  • core: Add Citation annotation (bd1538d)

Documentation

  • annotations: Fix correction shema and tutorial path (d0cd410)
  • annotations: Add examples and test (d45fd71)
  • annotation: Add json schema build script (c2a066d)
  • Add citation schema and example (abb8ad1)
  • Add index layer example (603fb00)
  • Add annotations types (4489e1e)
  • Add annotations page (910f8c3)

v0.7.74 (2021-12-18)

Fix

v0.7.73 (2021-12-14)

Fix

  • storage: Set remote url (23d4570)

Documentation

v0.7.72 (2021-12-06)

Fix

  • github: Remove return in update pecha download (de1c237)

v0.7.71 (2021-12-06)

Fix

  • github: Setup repo auth when download (cf315b4)

v0.7.70 (2021-12-06)

Fix

  • github: Setup auth for download repo (a11c86e)

v0.7.69 (2021-12-06)

Fix

  • storages: Add github repo auth (5b8cf8e)

v0.7.68 (2021-11-26)

Fix

  • core: Set default output path to save work (9488419)

v0.7.67 (2021-11-26)

Fix

  • core: Rset deefault path to save work (0b11e67)

v0.7.66 (2021-11-26)

Fix

  • work: Opwork id can be searched using bdrc instance id (a97796d)

v0.7.65 (2021-11-26)

Fix

  • work: Aadd load work from id (d75d67b)

v0.7.64 (2021-11-25)

Fix

  • core: Move work in work sub-pkg (691f9ee)
  • core: Save and load workk from yaml (1fba453)
  • core: Add Work model and test (366540d)
  • work: Add work test (caf8b52)

v0.7.63 (2021-11-25)

Fix

  • core: Get id for pecha, work, alignment and collection (aa88541)

v0.7.62 (2021-11-23)

Fix

  • github-storage: Rename publisher to storage and add and remove file (557f5d8)
  • publisher-github: Remove and get repo (550c486)
  • publishers-github: Aget dpecha description from about prop (93a341a)
  • publishers-github: Cimake path optional when remove pecha (6c004f0)
  • core-publisher: Add base publisher class (7746060)

v0.7.61 (2021-10-29)

Fix

  • hfml: Page annotation updated (a7eec5e)

v0.7.60 (2021-10-29)

Fix

  • test: Test data included separately (e951fca)

v0.7.59 (2021-10-28)

Fix

  • alignment-transifex: Set transifex project repo url (26bae15)
  • alignment-tmx: Add relation to alignment.yml's segmentsource (1a00972)
  • alignment-tmx: Add relation to alignment.yml's segmentsource (db65e41)
  • alignment-tmx: Add relation to alignment.yml's segmentsource (b96adc3)
  • po: Po view look up done on relation (1d81623)
  • alignment: Add import alignment cli (dc8aff0)
  • alignment: Return alignment repo path (7d0bf1c)
  • alignment: Default branch is master (413acbe)
  • alignment: Return project id and alignment title (c54b82d)
  • alignment: Base in exporter (4ce9e99)
  • alignment: Load metadata (876b817)
  • alignment-tmx: Add tmx parsers and po parsers (e462628)
  • alignment-tmx: Add tmx parsers and po parsers (cb7703c)
  • test: Po test data separated (3a92be9)
  • test: Po test data separated (461e0f1)
  • po: Language added while po export (8fcb59c)
  • alignment-tmx: Add create alignment from tmx (049d970)
  • po: Po exporter updated (e25d1db)
  • alignment-tmx: Add create alignment from tmx (ed3588c)
  • alignment-tmx: Add create alignment from tmx (8c2c592)
  • alignment: Get po view added (b832737)
  • po: Keyerro bug fixed and functions documented (d7ead02)
  • test: Test code updated (dbbae52)
  • alignment-transifex: Add traget language to the project (8a89e01)
  • po: Po view populated to po branch (226d36b)
  • bitext: Bitext exporter implemented (538c3e7)
  • alignment-transifex: Add TM to transifex project (7d861f0)
  • po: Po exporter implemented (26bd01d)

Documentation

  • alignment-transifex: Improve documentation (353175a)
  • alignment-transifex: Add doc_strings (6a7fab5)

v0.7.58 (2021-10-20)

Fix

  • blupdate: typos in update span (8d089f5)

v0.7.57 (2021-09-15)

Fix

  • test-formatter: Pedurma testcase updated according to changes made in formatter (8de3f01)
  • pedurma: Formatter and serializer test case updated (d965422)
  • pedurma-formatter: Pagination annotation changed to hfml format (bb80bbd)
  • pedurma-serializer: Doc string added (b9613ae)
  • test: Merge conflict resolved (9bb1a6c)
  • test: Test case added for pedurma formatter and serializer (64d2b98)
  • pedumra: Formatter for preview text and serializer of diplomatic text completed (d7ee2f5)
  • pedurma-formatter: Integration completed (5691391)
  • pedurmaFormatter: Pedurma note formatter and serializer added (c39863b)

v0.7.56 (2021-09-15)

Fix

  • hfml: Self.dump replace to dump_yaml from utils (b96eb87)

v0.7.55 (2021-09-15)

Fix

  • google-ocr: Flag added if meta required or not (1a80bc2)
  • google-ocr: Post processing for page is done to google ocred pages (62db212)

v0.7.54 (2021-09-06)

Fix

  • save_page: Starting of text in same vol updated properly (eb273e0)

v0.7.53 (2021-08-31)

Fix

  • proofreading: Span of sub text is update bug fixed (434638d)
  • proofreading: Subtext start update bug fixed (0fbeb4d)

v0.7.52 (2021-08-27)

Fix

  • docx-serializer: Font family included in styles (f794ca5)

v0.7.51 (2021-08-27)

Fix

  • editor-serializer: Multiple tsawa and citation type supported (d65835d)

v0.7.50 (2021-08-27)

Fix

  • serialisers: Unify aserialize api (3faa1b5)

v0.7.49 (2021-08-26)

Fix

  • epub: Enum used to avoid hardcoded condition checking (306a9c0)
  • editor-formatter: Ann type supports (b8a5f5a)
  • docx-serializer: Docx serializer formatted as class (024f19a)
  • test: Pecha asset added (f97fa31)
  • test: Chapter and booknumber added (8d3a56c)
  • testcase: New test case added (fff431f)
  • epub-serializer: Multiple type of citation and tsawa annotation supported (c35d7cf)
  • testcase: Test case imporved (97d3b4b)

v0.7.48 (2021-08-24)

Fix

  • epub-test: Test case updated according to new book title tag (d2c0d7f)
  • docx: Test added to docx serializer (97c1c87)
  • docx: Docx serializer added (c26b0cd)

v0.7.47 (2021-08-17)

Fix

  • hfml-serializer: Subtext serializer bug fix (01d31c2)

v0.7.46 (2021-08-13)

Fix

  • proofreading: Branch bug fixed (5702684)

v0.7.45 (2021-08-13)

Fix

  • cli: Also check for remote branch (224e170)
  • cli: Pass repo in branch evaluate (f3feebd)

v0.7.44 (2021-08-13)

Fix

  • proofreading: Branch option and sub text update included (af04c6f)

v0.7.43 (2021-08-12)

Fix

  • download-pecha: Set fallback branch (0286d53)

v0.7.42 (2021-08-12)

Fix

  • proofreading: Vol info return list of volume details (b8164a6)

v0.7.41 (2021-08-12)

Fix

  • proofreading: Method to assist proof reading editor added (d874f91)

v0.7.40 (2021-06-22)

Fix

  • Pagination layer name typo (29b6ede)

v0.7.39 (2021-06-02)

Fix

  • utils: Yaml loader and dumper is changed to Csafeloader and Csafedumper (da8ff8b)

v0.7.38 (2021-05-28)

Fix

  • Raise exception for pecha doesn't exist (69aa954)

v0.7.37 (2021-05-25)

Fix

  • core-pecha: Reset layers by reading components (031b8bb)

v0.7.36 (2021-05-21)

Fix

  • core.pecha: Add rest layers (7d25a4a)

v0.7.35 (2021-05-20)

Fix

  • test: Test for serializers are separated (0ed687a)
  • editor: Chapter serialize correctly in editor (cd2572a)
  • epub-serializer: Alt option added for img tag of credit page (540c6fa)
  • epub-serializer: Verse type annotation style changed (788ecab)

v0.7.34 (2021-05-13)

Fix

  • serialize: Text span bug fix (29ae052)
  • hfml-formatter: Topic end span for last page changed (a1d36de)
  • text-formatter: Text-formatter added (b78698f)

v0.7.33 (2021-05-06)

Fix

  • serializer: Line annotation removed (e3af107)

v0.7.32 (2021-05-05)

Fix

  • epub-serialise: Skip embedding ibook specification if epub doesnt exist (07b4acc)
  • epub-serializer: Renaming approch changed (ec620f4)
  • test-serialize: Epub serializer updated (d117d86)
  • epub-serializer: Ibook specification for proper font embedding included (107cb69)

v0.7.31 (2021-04-30)

Fix

  • hfml-serializer: Page index changed to imgnum (d83a86d)

v0.7.30 (2021-04-27)

Fix

  • New release (e16260c)
  • serializer: Index layer passed as parameter in order to avoid multiple loading of it (61a53ae)
  • hfml-serializer: Extra line at the end of pages bug resolved (395ec95)

v0.7.29 (2021-04-20)

Fix

v0.7.28 (2021-04-20)

Fix

  • Ncreate single ann as a group (b78bcf0)

v0.7.27 (2021-04-10)

Fix

  • Specify upper and lower bound for deps (cb2a629)

v0.7.26 (2021-04-09)

Fix

  • epub-serialize: Added front page generator using meta data (6f48523)
  • epub: Set default toc level if exist in serialized html (d475e15)

v0.7.25 (2021-04-08)

Fix

  • editor-formatter: Skip grouping if layer is empty (bd651cd)

v0.7.24 (2021-04-08)

Fix

  • Exlude alll ann attrs with value none (c2408e4)

v0.7.23 (2021-04-07)

Fix

  • google-ocr: Add imgnum to page ann (76ccc8d)

v0.7.22 (2021-03-29)

Fix

  • Create layer if doesn't exist (af3ab83)

v0.7.21 (2021-03-26)

Fix

  • verse-annotation: Isverse attribute of verse type annotation changed to is_verse (4173c1a)

v0.7.20 (2021-03-26)

Fix

  • test_serializer: Testcase added for editor serializer (2c4d653)
  • editor-serializer: P tag introduced to verse components (7d4800f)
  • editor-serializer: Footnote serialization enabled (53149e3)

v0.7.19 (2021-03-26)

Fix

  • Grouping root-text and find verse (a1a2484)

v0.7.18 (2021-03-25)

Fix

v0.7.17 (2021-03-25)

Fix

  • Radd missing layers and improve test (f336aef)

v0.7.16 (2021-03-24)

Fix

  • Return output from editor serializer instead of saving (c6297c0)
  • Author css class (583d688)
  • editor-serializer: Added special serializer for editor (0de0dbf)

v0.7.15 (2021-03-23)

Fix

  • Aupdate base and layers (96c95b8)
  • Add editor outpur parser (ef85a5f)

v0.7.14 (2021-03-23)

Fix

  • epub-serializer: Removed credit page layer n added credit page img tag after first author (0342a17)
  • epub-serializer: Removed credit page layer n added credit page img tag after first author (044e119)
  • Toc level variable changed (c5c07af)
  • Toc level variable changed (3734339)

v0.7.13 (2021-03-17)

Fix

  • Cbranch checkout in pecha download (d89c607)

v0.7.12 (2021-03-16)

Fix

v0.7.11 (2021-03-16)

Fix

  • epub-serializer: Disabled font rescaling (1b63245)
  • epub-serializer: Margin given to sabche, citation and tsawa which are not inline (4cc8637)
  • epub-serializer: Auto generation of TOC added (1db097c)
  • epub-serializer: Footnote marker and reference serializer added (1446879)
  • epub-serializer: Replace regular_indented1 of para before chapter title to regular_indented1 (fc243a6)
  • tsadra-formatter: Removed is_cover and is_subtitle. created new annotation sub title (7f0967a)
  • epub-serializer: Page break after chapter tag included (b481d59)
  • tsadra-formatter: Yigchung formatter updated as corner case detected (870f198)
  • epub-serializer: Intentation adjustment bug fix (018b5de)
  • epub-serializer: Indentation bug fixed (ab405d5)
  • epub-serializer: Break within title bug are fixed by inserting break before title (fc3b8fa)
  • epub-serializer: Indentation bug fixed for verse component (819c292)
  • tsadra-parser: Credit page parser added (61c3648)

v0.7.10 (2021-03-11)

Fix

v0.7.9 (2021-03-11)

Fix

  • Remove 'opecha' from pecha id (064abab)

v0.7.8 (2021-03-09)

Fix

  • Pecha has components list (base + layer names) (bb1d206)

v0.7.7 (2021-03-04)

Fix

v0.7.6 (2021-03-03)

Fix

  • Iread pecha id from metadata (d9f008e)

v0.7.5 (2021-03-03)

Fix

  • blupdate test (2875e86)
  • download pecha with specific branch (4f4c9b7)
  • Layer annotations are in dict (de625ff)

v0.7.4 (2021-03-02)

v0.7.3 (2021-03-01)

Fix

  • Create review branch for opf (0bd7911)

v0.7.2 (2021-03-01)

Fix

  • Ireturn empty string instead of None for source meta access (b2390f9)

v0.7.1 (2021-03-01)

Fix

  • Make output_path optional (5b67259)

v0.7.0 (2021-03-01)

Feature

Fix

  • Cconfig for default pechas path (b92c3b7)

v0.6.35 (2021-03-01)

Fix

  • Madd pydantic in dependencies (16f6ed0)
  • Add create opf for ebook (efe8bf8)

v0.6.34 (2021-02-16)

Fix

  • Make methods to module funcs (7fec0d6)

v0.6.33 (2021-02-09)

Fix

  • Csave src volumn id to base file in meta (50e2717)

v0.6.32 (2021-02-03)

Fix

  • Add needs_pecha option while pecha download (693086d)

v0.6.31 (2021-01-21)

Fix

  • Test for old annotations structure (f8ea280)

v0.6.30 (2021-01-21)

Fix

  • git-utils: Dpecha with main as default branch (9860836)

v0.6.29 (2021-01-20)

Fix

  • Missing encoding specification (051e2c2)

v0.6.28 (2021-01-19)

Fix

v0.6.27 (2021-01-18)

Fix

  • Blupdate testcase and add cli (18d5a0c)

v0.6.26 (2021-01-15)

Fix

  • serializer: Empty line serialize correctly (e906287)
  • Return local path of downloaded pecha (fae9e58)
  • hfml-serializer: Pagination needs to serialize last (468191b)
  • pagewise: Pagewise obj created (2480e03)
  • hfm-formatter: Durchen parser added (a4bdd15)
  • serialize-base-module: Empty line serialize correctly (cb4ebca)
  • formatter-basemodule: Pecha without text formatted correctly (ff5dee2)
  • hmfl-formatter: Empty line formatted correctly (5fcbe87)

<<<<<<< HEAD

v0.6.23 (2021-01-15)

Fix

  • Return local path of downloaded pecha (fae9e58)

v0.6.22 (2021-01-12)

Fix

  • hfml-serializer: Pagination needs to serialize last (468191b)
  • pagewise: Pagewise obj created (2480e03)
  • hfm-formatter: Durchen parser added (a4bdd15)
  • serialize-base-module: Empty line serialize correctly (cb4ebca)
  • formatter-basemodule: Pecha without text formatted correctly (ff5dee2)
  • hmfl-formatter: Empty line formatted correctly (5fcbe87)

v0.6.21 (2020-12-23)

Fix

v0.6.25 (2021-01-15)

Fix

  • Import bonltk only when in use (a7e5b48)

v0.6.24 (2021-01-15)

Fix

  • Import bonltk only when in use (3ecd475)
  • Return local path to downloaded pecha (e91fb4d)

bde09bb05942f5eb18923cf3326c2a5f7180a10f

v0.6.20 (2020-11-23)

Fix

  • Break after 500 syllables (8d93203)

v0.6.19 (2020-11-16)

Fix

  • Move catalog manager into catalog sub-pkg (fb6b084)
  • Download corpus cli (69b82e6)
  • Get base of all pechas (878a31b)

v0.6.18 (2020-10-23)

Fix

  • Create_file to accept org and token (7bf1562)

v0.6.17 (2020-10-23)

Fix

  • Github helper funcs to work on any org (b3aff03)

v0.6.16 (2020-10-21)

Fix

v0.6.15 (2020-10-15)

Fix

  • Disable line-number for epub (3ed677e)

v0.6.14 (2020-10-13)

Fix

  • Create prerelease and return asset download url (2b84ee3)

v0.6.13 (2020-10-08)

Fix

  • Return serialized epub path (7d3e8a9)

v0.6.12 (2020-10-08)

Fix

  • Skip the hfml_serializer test for a new release (25fba24)
  • Unified opfpath to opf_path (9646b34)
  • Pecha_id as arguments (e288b1d)
  • Improve log message (338e661)
  • Fix sub topic nested list (f22dcd1)
  • Import error (218c2bf)