Documentos de Académico
Documentos de Profesional
Documentos de Cultura
ipynb - Colaboratory
https://colab.research.google.com/drive/12SiAFFDJFCQUvPvl510oCrBfanUOUpzw?usp=sharing#scrollTo=wY_dQnJJwidq&printMode=true 1/5
4/12/2020 Limpieza_datos.ipynb - Colaboratory
100% Completed]
#Visualizar
Generate el tipo
report de dato
structure: 100%que se encuentra en cada columna 1/1 [00:08<00:00, 8.58s/it]
df_raw_headers.info()
Render HTML:
<class 100%
'pandas.core.frame.DataFrame'> 1/1 [00:01<00:00, 1.80s/it]
RangeIndex: 6999 entries, 0 to 6998
Data columns (total 17 columns):
Export
# report to file: 100% Non-Null Count Dtype
Column 1/1 [00:00<00:00, 16.03it/s]
--- ------ -------------- -----
0 id 6999 non-null object
1 ingresos 6069 non-null object
2 egresos 6999 non-null object
3 activos 6082 non-null object
4 pasivos 6090 non-null object
5 vol_trans 6999 non-null object
6 a_econ 6074 non-null object
7 edad 6999 non-null object
8 anos_afili 6079 non-null object
9 genero 6348 non-null object
10 estado_civil 6999 non-null object
11 ocupacion 6999 non-null object
12 nivel_estudio 6999 non-null object
13 tipo_vivienda 6999 non-null object
14 estrato 6078 non-null object
15 num_hijos 6999 non-null object
16 personas_cargo 6999 non-null object
dtypes: object(17)
memory usage: 929.7+ KB
df_raw_headers['genero'].value_counts(dropna=False)
df_final['genero'].value_counts()
F 2937
M 2591
NINGUNO 538
NO 1
Name: genero, dtype: int64
x=df_final['genero'].value_counts(dropna=False).keys()
y=df_final['genero'].value_counts(dropna=False).to_list()
x1=df_final['estado_civil'].value_counts(dropna=False).keys()
y1=df_final['estado_civil'].value_counts(dropna=False).to_list()
fig = plt.figure()
ax = fig.add_axes([0,0,1,1])
ax.bar(x,y)
ax.set_title('Datos de columna Genero')
plt.show()
https://colab.research.google.com/drive/12SiAFFDJFCQUvPvl510oCrBfanUOUpzw?usp=sharing#scrollTo=wY_dQnJJwidq&printMode=true 3/5
4/12/2020 Limpieza_datos.ipynb - Colaboratory
ax2.bar(x1,y1)
ax2.set_title('Datos de columna Estado Civil')
plt.show()
https://colab.research.google.com/drive/12SiAFFDJFCQUvPvl510oCrBfanUOUpzw?usp=sharing#scrollTo=wY_dQnJJwidq&printMode=true 4/5
4/12/2020 Limpieza_datos.ipynb - Colaboratory
https://colab.research.google.com/drive/12SiAFFDJFCQUvPvl510oCrBfanUOUpzw?usp=sharing#scrollTo=wY_dQnJJwidq&printMode=true 5/5