Guia 4

R para finanzas
Guía de Ejercicios 4
Análisis exploratorio de datos
Profesor: Víctor Macías E.
En esta guía usaremos los datos de “World Development Indicators” (WDI) obtenidos en la Guía de Ejercicios
3.
library(readr)
library(dplyr)
library(janitor)
library(kableExtra)
library(skimr) # estadística descriptiva
wbData1 <- read_rds("datos/wbData_FIN1960_2020.rds")
Pregunta 1
Nombre a las variables SP.P OP.T OT L, N Y.GDP.P CAP.P P.CD, F D.AST.P RV T.GD.ZS y F B.AT M.T OT L.P 5
como population, gdppc, credit y atm respectivamente.
wbData2 <- wbData1 %>% rename(population= SP.POP.TOTL,

gdppc = NY.GDP.PCAP.PP.CD,
credit = FD.AST.PRVT.GD.ZS,
atm = FB.ATM.TOTL.P5)
Pregunta 2
Ordene los datos en orden ascendente primero por country y luego por year
1
wbData2 <- wbData2 %>% arrange(country, year)
Pregunta 3
Cambie los nombres de las regiones a español. Por ejemplo, cambie la región “Middle East & North Africa”
a “Medio Oriente y Norte de Africa”.
wbData2 <- wbData2 %>%

mutate(region = recode(region,
"East Asia & Pacific" = "Este Asiático y Pacífico",
"Europe & Central Asia" = "Europa y Asia Central" ,
"Latin America & Caribbean" = "América Latina y el Caribe",
"Middle East & North Africa" = "Medio Oriente y Norte de Africa",
"North America" = "Norteamérica",
"South Asia" = "Asia del Sur",
"Sub-Saharan Africa" = "Africa Subsahariana"))
Pregunta 4
¿Cuántos países tiene la base de datos?, ¿Cuántos países por región tiene la base de datos?
wbData3 <- wbData2 %>% filter(region != "Aggregates")
wbData3 %>% group_by(region) %>% distinct(country) %>%

count() %>% adorn_totals("row") %>%
kbl(booktabs = T,
col.names = c("Región", "Núm. países"),
linesep = '') %>%
kable_styling(latex_options = c("striped", "hold_position"))
Pregunta 5
Identifique los 5 países con mayor población el año 2019
wbData3 %>% filter(year == 2019) %>%

select(country, population) %>%
slice_max(population, n = 5) %>%
kbl(booktabs = T,
col.names = c("País", "Población"),
linesep = '') %>%
2
Pregunta 6
Identifique los 10 países con mayor crecimiento de su población entre los años 2018 y 2019.
wbData3 %>%
mutate(crecim_poblacional = round(((population -lag(population))/lag(population))*100, 1)) %>%
filter(year == 2019) %>%
select(country, crecim_poblacional) %>%
slice_max(crecim_poblacional, n = 10) %>%
kbl(booktabs = T,
col.names = c("País", "Crecimiento población ( %)"),
linesep = '') %>%
Pregunta 7
Identifique los 10 países con menor PIB per cápita el año 2019
wbData3 %>% filter(year == 2019) %>%

select(country, gdppc) %>%
slice_min(gdppc, n = 10) %>%
kbl(booktabs = T,
col.names = c("País", "PIB per cápita (dólares)"),
linesep = '',
digits = 0) %>%
Pregunta 8
Para cada una de las regiones del mundo, determine los valores mínimos y máximos del PIB per cápita el
año 2019. Además, identifique los países con los mínimos y máximos del PIB per cápita en cada una de las
regiones.
wbData3 %>% filter(year ==2019) %>% group_by(region) %>%

select(region, country, gdppc) %>%
slice_min(gdppc, n = 1) %>%
kbl(booktabs = T,
col.names = c("Región", "País", "PIB per cápita (dólares)"),
linesep = '',
digits = 0) %>%
wbData3 %>% filter(year ==2019) %>% group_by(region) %>%

select(region, country, gdppc) %>%
slice_max(gdppc, n = 1) %>%
kbl(booktabs = T,
col.names = c("Región", "País", "PIB per cápita (dólares)"),
3
linesep = '',
digits = 0) %>%
Pregunta 9
Para cada una de las regiones del mundo, presente una tabla con estadística descriptiva para las variables PIB
per cápita, crédito bancario al sector privado y número de ATMs para el año 2018. Nota: Se recomiendan
revisar las funciones del paquete skimr en https:// cran.r-project.org/ web/ packages/ skimr/ index.html.
# Forma 1
wbData3 %>%
select(gdppc, credit, atm) %>%
skim() %>%
yank("numeric") %>%
select(-c("complete_rate", "hist"))
# Forma 2
wbData3 %>%
select(gdppc, credit, atm) %>%
skim() %>%
yank("numeric") %>%
select(-c("complete_rate", "hist")) %>%
kbl(booktabs = T,
col.names = c("Variable", "NA's", "Media", "Desv. Est.",
"Mínimo", "p25", "p50", "p75", "Máximo"),
digits = 1) %>%
write_rds(wbData3, file = "datos/wbData3_finanzas.rds")

Guia 4

Cargado por

Información del documento

Descripción original:

Título original

Derechos de autor

Formatos disponibles

Compartir este documento

Compartir o incrustar documentos

Opciones para compartir

¿Le pareció útil este documento?

¿Este contenido es inapropiado?

Copyright:

Formatos disponibles

Guia 4

Cargado por

Copyright:

Formatos disponibles

R para finanzas

Profesor: Víctor Macías E.

wbData1 <- read_rds("datos/wbData_FIN1960_2020.rds")

wbData2 <- wbData1 %>% rename(population= SP.POP.TOTL,

wbData2 <- wbData2 %>%

wbData3 <- wbData2 %>% filter(region != "Aggregates")

wbData3 %>% group_by(region) %>% distinct(country) %>%

wbData3 %>% filter(year == 2019) %>%

wbData3 %>% filter(year == 2019) %>%

wbData3 %>% filter(year ==2019) %>% group_by(region) %>%

wbData3 %>% filter(year ==2019) %>% group_by(region) %>%

write_rds(wbData3, file = "datos/wbData3_finanzas.rds")

También podría gustarte