dse-interview

This dataset is part of the Give Me Some Credit Kaggle competition. The data contains loan applicant information collected by a US credit bureau. Each row represents a single loan application and the information gathered on the applicant at the time of the application. This project does ETL into a specific database in batch scheduler and web API as well. Then, it presents how the data looks and feels in jupyter notebook (markdown) for Data Scientist Engineer interview.

View on GitHub

Welcome to Data Science Engineering Pages

The Data Science Engineer is placed between Data Scientist and Data Engineer so that it needs to be familiar with data engineering such as ETL and DWH, plus presenting data schema and shape/distribution including some of ML/DL.

Supposing a service running on a platform, ingesting raw data into RDMS with preprocessing should be considered as a regular batch in the scheduler.

This repository represents a solution Here stated above as the interview assignment for a Data Science Engineer which I had experienced.

(Thanks to Paidy Co., Ltd for a good assignment)