Movatterモバイル変換


[0]ホーム

URL:


Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
Appearance settings

This project focuses on analyzing COVID-19 mortality data in Chile from 2020 to 2024. With at least 60,000 records, the dataset provides critical insights into the impact of the pandemic on the population.

NotificationsYou must be signed in to change notification settings

CamilaJaviera91/Chile-COVID-Mortality-2020-2024

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 

Repository files navigation

📃 Project Description

  • Objective:

    • AnalyzeCOVID-19 mortality data in Chile from 2020 to 2024, providing critical insights into the pandemic's impact on the population.
  • Project Division:

    • First Part: Work withGoogle Sheets for initial data exploration and analysis.
    • Second Part: UsePostgreSQL for advanced analysis and data storage.
  • Methodology:

    • Google Sheets:

      • UsePython for data processing and analysis.
      • Collaboratively explore data usingGoogle Sheets.
    • PostgreSQL:

      • Migrate data to aPostgreSQL database for more detailed analysis and advanced queries.
  • Key Objectives:

    • Data Cleaning and Organization: Prepare the mortality dataset for meaningful analysis.
    • Exploration of Trends and Patterns: Investigate mortality rates by region, age, and time.
    • Data Visualization: Create effective visualizations to communicate key findings.

📋 Google Sheetsgit chec

The project leveragesPython for data processing and analysis andGoogle Sheets for collaborative data exploration.

🎯 Key objectives

  • Cleaning andorganizing the mortality dataset for meaningful analysis.
  • Exploring trends and patterns, such as mortality rates by region, age, or time.
  • Visualizing data to communicate insights effectively.
  • Building reproducible workflows for dataanalysis andreporting.

🗄️ Data Dictionary

Variable NameDescriptionData TypeValues
1AÑOYear of the death dateNumberN/A
2FECHA_DEFDate of the death dateDateN/A
3SEXOGloss identifying biological sexNumber1: Male, 2: Female, 9: Other
4EDAD_TIPOUnit of age measurementNumber1: Years, 2: Months, 3: Days
5EDAD_CANTNumerical record of the patient's age at admissionNumberN/A
6COD_COMUNACode of the commune of residence of the deceased, according to the political-administrative division updated in 2019NumberN/A
7COMUNAGloss of the commune of residence of the deceased, according to the political-administrative division updated in 2019TextN/A
8NOMBRE_REGIONGloss of the region of residenceTextN/A
9DIAG1Basic cause of deathTextN/A
10CAPITULO_DIAG1ICD-10 chapter according to the cause of deathTextN/A
11GLOSA_CAPITULO_DIAG1Gloss of the ICD-10 chapter according to the cause of deathTextN/A
12CODIGO_GRUPO_DIAG1Group code of the cause of death according to ICD-10TextN/A
13GLOSA_GRUPO_DIAG1Gloss of the group code of the cause of death according to ICD-10TextN/A
14CODIGO_CATEGORIA_DIAG1Category code of the cause of death according to ICD-10TextN/A
15GLOSA_CATEGORIA_DIAG1Gloss code of the cause of death according to ICD-10TextN/A
16CODIGO_SUBCATEGORIA_DIAG1Category code of the cause of death according to ICD-10TextN/A
17GLOSA_SUBCATEGORIA_DIAG1Gloss of the category code of the cause of death according to ICD-10TextN/A
18DIAG2External cause of deathTextN/A
19CAPITULO_DIAG2ICD-10 chapter according to the cause of deathTextN/A
20GLOSA_CAPITULO_DIAG2Gloss of the ICD-10 chapter according to the cause of deathTextN/A
21CODIGO_GRUPO_DIAG2Group code of the cause of death according to ICD-10TextN/A
22GLOSA_GRUPO_DIAG2Gloss of the group code of the cause of death according to ICD-10TextN/A
23CODIGO_CATEGORIA_DIAG2Category code of the cause of death according to ICD-10TextN/A
24GLOSA_CATEGORIA_DIAG2Gloss of the category code of the cause of death according to ICD-10TextN/A
25CODIGO_SUBCATEGORIA_DIAG2Category code of the cause of death according to ICD-10TextN/A
26GLOSA_SUBCATEGORIA_DIAG2Gloss of the category code of the cause of death according to ICD-10TextN/A
27LUGAR_DEFUNCIONDescribes the place where the death occursTextN/A

✨🗄️✨ Clean Data Dictionary

Variable NameDescriptionData TypeValues
1AÑOYear of the death dateNumberN/A
2FECHA_DEFDate of the death dateDateN/A
3SEXO_NOMBREGloss identifying biological sexNumberN/A
4EDAD_CANTNumerical record of the patient's age at admissionNumberN/A
5COMUNAGloss of the commune of residence of the deceased, according to the political-administrative division updated in 2019TextN/A
6NOMBRE_REGIONGloss of the region of residenceTextN/A
7LUGAR_DEFUNCIONDescribes the place where the death occursTextN/A
8PERIODOPeriod of the death dateDateN/A
9RANGO_ETARIOAge range of the deceasedTextN/A

📊 Charts

Month Deaths By Gender

charts1

Yearly Deaths By Region

charts2

Yearly Deaths By Age Range

charts3

📝 Download PDF

Month Deaths By Gender

download1

Yearly Deaths By Region

download2

Yearly Deaths By Age Range

download3

🔗 Important Links


🗄️ Postgres

The project leveragesPython for data processing and analysis andPostgres for collaborative data exploration.

🚀 Installation & Setup

Prerequisites

Ensure you have the following installed:

  • PostgreSQL (Download)
  • Python 3.x (Download)
  • DBeaver (optional, for database visualization:Download)
  • Required Python Libraries:
  pip install pandas scikit-learn psycopg2 matplotlib seaborn numpy reportlab fpdf googleapiclient google

🗄️ Database Schema

Connection

  1. In this case, we are going to useDBeaver, so the first thing we need to do is create a new connection.
  • As shown in the picture, we need to configure the following settings:
    • Host: localhost
    • Database: postgres
    • Port: 5432
    • Username: postgres
    • Password:YOUR PASSWORD

connection1

  1. After setting the parameters, we should test the connection to ensure it works before applying the configuration.

connection2

📌 Schema Diagram

connection2

covid_data

- AÑO (INT) - Year of the death date | Number- FECHA_DEF (DATE) - Date of the death date - SEXO_NOMBRE (VARCHAR(10)) - Gloss identifying biological sex- EDAD_CANT (INT) - Numerical record of the patient's age at admission- COMUNA (VARCHAR(100)) - Gloss of the commune of residence of the deceased, according to the political-administrative division updated in 2019- NOMBRE_REGION (VARCHAR(100)) - Gloss of the region of residence- LUGAR_DEFUNCION (VARCHAR(100)) - Describes the place where the death occurs- PERIODO (VARCHAR(7)) - Period of the death date- RANGO_ETARIO (VARCHAR(20)) - Age range of the deceased

✍🏻 Insert Data from Terminal

  • Since the data we have is quite large, we need to insert it through the terminal to avoid issues when using the editor.
psql -U postgres -d mi_base_de_datos -h localhost -p 5432 -f /home/usuario/Chile-COVID-Mortality-2020-2024/postgres/tables/add_into_tables.sql

🛠️ Example of a Query

select cd."AÑO" as "year",cd."NOMBRE_REGION" as "region",count(cd."EDAD_CANT") as "quantity",sum(case when cd."SEXO_NOMBRE" = 'Mujer' then 1 else 0 end) as "Women",sum(case when cd."SEXO_NOMBRE" = 'Hombre' then 1 else 0 end) as "Men"from covid_chile.covid_data cdgroup by "year", "region"order by "year";

📊 Data Analysis

Yearly Deaths by Gender and Region (PDF table)

analysis1

Yearly Deaths by Gender (Graphic)

analysis2

Yearly Deaths by Region (Graphic)

analysis3


🎖️ Acknowledgments

The data used in this project, including COVID-19 mortality records in Chile from 2020 to 2024, is provided by the Chilean government’s open data platform:

acknowledgment1

Special thanks toPamela Suarez(deis@minsal.cl) for creating and curating the dataset. I acknowledge and appreciate her work in making this information publicly accessible.


Thanks to theMinisterio de Salud de Chile and related governmental agencies for making this data publicly available.

About

This project focuses on analyzing COVID-19 mortality data in Chile from 2020 to 2024. With at least 60,000 records, the dataset provides critical insights into the impact of the pandemic on the population.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages


[8]ページ先頭

©2009-2025 Movatter.jp