This repository contains analysis of IMDB data from multiple sources and analysis of movies/cast/box office revenues, movie brands and franchises.
-
Updated
Jun 1, 2020 - TSQL
This repository contains analysis of IMDB data from multiple sources and analysis of movies/cast/box office revenues, movie brands and franchises.
Reedelk Runtime Platform Community Edition
Hospital Database Management System (DBMS) is a comprehensive SQL project designed to streamline and optimize the management of hospital operations. This project aims to provide an efficient and user-friendly solution for storing, retrieving, and manipulating various types of healthcare-related data.
AIDevs project files
Integrating multimodal data through heterogeneous ensembles
Uses Rapid API to fetch IMDb data, filters, & uploads the data in different tables in a MySQL Database, in one click using Talend.
This repository for a project detailing the step by step approach of scraping data, integrating data from various sources, performing analysis on data from various sources for the purpose of analaysis. It also shows how APIs can be harnessed for data engr operations. In this project, the four square API was utilized for the location data.
A project to enhance ontology matching accuracy using Large Language Models (LLMs) like S-BERT.
Describe the different entities that form a modern data ecosystem. Describe and differentiate between the role and responsibilities of Data Engineers, Data Scientists, Data Analysts, Business Analysts, and Business Intelligence Analysts. Explain what Data Engineering is. List the tasks that need to be performed in a typical data engineering life…
Farr, M. T., D. S. Green, K. E. Holekamp, and E. F. Zipkin. 2020. Integrating distance sampling and presence-only data to estimate species abundance. Ecology 00(00):e03204. 10.1002/ecy.3204
Project involves merging customer reviews from Fudgemart and FudgeFlix to create a unified data warehouse using Kimball's approach. Utilizing Power BI, it aims to extract actionable insights for Fudge Inc., guiding strategic decisions, product enhancements, and market expansion based on comprehensive business intelligence.
Hormone Therapy Decision Support System for Breast Cancer
Various predictor factors to try to generate a forecast about heart disease patients and Logistic regression and K-Nearest Neighbor to develop a model to predict whether the patients have heart disease or not for the analysis, Finally Some basic visualizations.
To integrate data from "Orderline.csv" and "Product.csv" using Talend, filtering based on price, and performing inner and left joins to extract insights and facilitate data warehousing integration with Microsoft SQL Server.
A lab for DataAnalytics | DataEngineering | AnalyticsEngineering | DataScience | DataVisualization | BusinessIntelligence
Shoply is a dynamic eCommerce platform built with Django 5, featuring secure payments via Stripe, responsive design with Bootstrap 5, and a custom Kaggle data pipeline for automated product management. Scalable and interactive, it highlights expertise in full-stack development and real-world data integration.
Data integration and other data related programs
MeshJoin-Streaming-ETL-Data-Warehouse integrates real-time transactional data with master data using the Mesh Join algorithm. It processes and enriches data, then loads it into a data warehouse for analysis, leveraging efficient ETL processes and OLAP-ready SQL queries.
This repository contains an SSIS package that splits employee data from the AdventureWorksDW2017 database into country-specific tables (United States, United Kingdom, Germany, and others). It demonstrates ETL processes using tools like Merge Join, Conditional Split, and OLE DB Destination for efficient data integration.
Add a description, image, and links to the dataintegration topic page so that developers can more easily learn about it.
To associate your repository with the dataintegration topic, visit your repo's landing page and select "manage topics."