Skip to content
#

parquet-files

Here are 22 public repositories matching this topic...

This repository contains the NYC Taxi Data Engineering Pipeline project, which aims to build a comprehensive data engineering pipeline using NYC taxi data from the years 2022 and 2023. The pipeline involves extracting, transforming and loading (ETL) data into a Snowflake database, followed by creating a dashboard for visualisation.

  • Updated Jul 4, 2024
  • Python

Aplicação que captura mensagens de um grupo de Telegram e as armazena diariamente em arquivos, utilizando AWS S3 para armazenamento em nuvem. Em seguida, as mensagens são analisadas com foco em sentimento, menções a produtos da empresa e detecção de intenção de compra. O processamento é automatizado em batch usando funções Lambda da AWS.

  • Updated Aug 30, 2024
  • Python

Improve this page

Add a description, image, and links to the parquet-files topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the parquet-files topic, visit your repo's landing page and select "manage topics."

Learn more