Skip to content

Commit

Permalink
Perf: Pandas modin lib (#114)
Browse files Browse the repository at this point in the history
* wip

* perf: use modin instead of pandas

* poetry lock

* fix: merge
  • Loading branch information
polomarcus authored Feb 26, 2024
1 parent 14a20d1 commit 510da04
Show file tree
Hide file tree
Showing 13 changed files with 2,641 additions and 276 deletions.
3 changes: 3 additions & 0 deletions docker-compose.yml
Original file line number Diff line number Diff line change
Expand Up @@ -6,6 +6,7 @@ services:
context: ./
dockerfile: Dockerfile
entrypoint: ["poetry", "run", "pytest", "--cov-report", "term:skip-covered", "--cov=quotaclimat", "--cov=postgres", "test/"]
#entrypoint: ["sleep", "1200"] # use to debug the container if needed
environment:
ENV: docker
# CHANNEL: "fr3-idf"
Expand All @@ -16,6 +17,7 @@ services:
POSTGRES_PASSWORD: password
POSTGRES_HOST: postgres_db
POSTGRES_PORT: 5432
MODIN_ENGINE: dask #TODO test me
tty: true # colorize terminal
volumes:
- ./quotaclimat/:/app/quotaclimat/
Expand Down Expand Up @@ -142,6 +144,7 @@ services:
MEDIATREE_PASSWORD: /run/secrets/pwd_api
MEDIATREE_AUTH_URL: https://keywords.mediatree.fr/api/auth/token/
KEYWORDS_URL: https://keywords.mediatree.fr/api/subtitle/ # https://keywords.mediatree.fr/docs/#api-Subtitle-SubtitleList
MODIN_ENGINE: dask #TODO test me
volumes:
- ./quotaclimat/:/app/quotaclimat/
- ./postgres/:/app/postgres/
Expand Down
Loading

1 comment on commit 510da04

@github-actions
Copy link

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Coverage

Coverage Report
FileStmtsMissCoverMissing
postgres
   insert_data.py46785%38–40, 59–61, 66
   insert_existing_data_example.py20385%25–27
postgres/schemas
   models.py711579%74–81, 91–92, 101–111
quotaclimat/data_analytics
   analytics_signataire_charte.py29290%1–67
   bilan.py1081080%2–372
   data_coverage.py34340%1–94
   exploration.py1251250%1–440
   sitemap_analytics.py1181180%1–343
quotaclimat/data_ingestion
   categorization_program_type.py110%1
   config_youtube.py110%1
   scaleway_db_backups.py34340%1–74
   scrap_chartejournalismeecologie_signataires.py50500%1–169
   scrap_sitemap.py1341787%27–28, 33–34, 66–71, 95–97, 138–140, 202, 223–228
   scrap_tv_program.py62620%1–149
   scrap_youtube.py1141140%1–238
quotaclimat/data_ingestion/ingest_db
   ingest_sitemap_in_db.py594131%21–42, 45–65, 69–80
quotaclimat/data_ingestion/scrap_html
   scrap_description_article.py36392%19–20, 32
quotaclimat/data_processing/mediatree
   api_import.py17710242%38–42, 47–50, 54–57, 63, 66–93, 99–114, 119–121, 146–153, 157–160, 164–170, 181–192, 195–199, 205, 231–232, 236, 240–259, 263–274
   config.py15287%7, 16
   detect_keywords.py108397%129–131
   utils.py642167%27–51, 54, 73–74
quotaclimat/data_processing/sitemap
   sitemap_processing.py412734%15–19, 23–25, 29–47, 51–58, 66–96, 101–103
quotaclimat/utils
   channels.py660%1–95
   climate_keywords.py220%3–35
   healthcheck_config.py291452%22–24, 27–38
   logger.py14379%22–24
   plotly_theme.py17170%1–56
   sentry.py10280%21–22
TOTAL157596139% 

Tests Skipped Failures Errors Time
44 0 💤 0 ❌ 0 🔥 50.376s ⏱️

Please sign in to comment.