Documentos de Académico
Documentos de Profesional
Documentos de Cultura
SISTEMA CORPORATIVO
FACULTAD DE ARQUITECTURA E INGENIERÍA
Facilitador
Tomás F. Ramírez
Presentado por
Jonathan Rodríguez 1-18-0622
Pandas: es una librería que nos permite realizar el análisis y manipulación de los datos que
queremos utilizar. Gracias a esta libreria podemos utilizar los dataframe para poder guardar o
almacenar nuestros datos para que estos puedan ser analizados y limpiados.
Numpy: esta libreria es muy usada en la ciencia de datos y permite, al igual que la libreria
pandas, la manipulación de datos. Podemos utilizar esta libreria para trabajar con arreglos
multidimensionales de alta eficiencia. Además, proporciona potentes estructuras de datos para
realizar los cálculos matemáticos.
Matplotlib: con esta libreria podemos realizar gráficas en 2D dentro de Python, permitiendonos
representar listas y vectores gráficamente. Esta libreria nos ofrece tener una mejor comprensión
de los resultados tras realizar los análisis, puesto que muchas veces una imagen vale mas que
mil palabras y son mas faciles de interpretar.
Ejercicios
In [ ]:
import pandas as pd
data = pd.read_csv("titanic3.csv")
In [2]:
data.head()
Out[2]:
pclass survived name sex age sibsp parch ticket fare cabin embarked
Allen,
Miss.
0 1 1 female 29.0000 0 0 24160 211.3375 B5 S
Elisabeth
Walton
Allison,
Master. C22
1 1 1 male 0.9167 1 2 113781 151.5500 S
Hudson C26
Trevor
Allison,
Miss. C22
2 1 0 female 2.0000 1 2 113781 151.5500 S
Helen C26
Loraine
Allison,
Mr.
C22
3 1 0 Hudson male 30.0000 1 2 113781 151.5500 S
C26
Joshua
Creighton
1 of 7 2/11/2021, 12:27 AM
TareaLab - Jupyter Notebook http://localhost:8889/notebooks/Downloads/TareaLab.ipynb#
pclass survived name sex age sibsp parch ticket fare cabin embarked
Allison,
2- Desarrollar una función para aplicar a la columna sexo del dataset titanic3, donde sustituya
female por el valor 0 y male por el 1.
In [ ]:
data["sex"].replace(["female", "male"], [0, 1], inplace=True)
In [4]:
data.head()
Out[4]:
pclass survived name sex age sibsp parch ticket fare cabin embarked boat
Allen,
Miss.
0 1 1 0 29.0000 0 0 24160 211.3375 B5 S
Elisabeth
Walton
Allison,
Master. C22
1 1 1 1 0.9167 1 2 113781 151.5500 S
Hudson C26
Trevor
Allison,
Miss. C22
2 1 0 0 2.0000 1 2 113781 151.5500 S NaN
Helen C26
Loraine
Allison,
Mr.
C22
3 1 0 Hudson 1 30.0000 1 2 113781 151.5500 S NaN
C26
Joshua
Creighton
Allison,
Mrs.
Hudson J C22
4 1 0 0 25.0000 1 2 113781 151.5500 S NaN
C (Bessie C26
Waldo
Daniels)
In [25]:
data.duplicated()
Out[25]:
2 of 7 2/11/2021, 12:27 AM
TareaLab - Jupyter Notebook http://localhost:8889/notebooks/Downloads/TareaLab.ipynb#
0 False
1 False
2 False
3 False
In [26]:
data = data.drop_duplicates()
In [27]:
data.duplicated()
Out[27]: 0 False
1 False
2 False
3 False
4 False
...
1304 False
1305 False
1306 False
1307 False
1308 False
Length: 1309, dtype: bool
4- Remplazar Miss por Srta, Mr. por Sr., Mrs por Sra en la columna name.
In [5]:
data["name"] = data["name"].replace({"Miss":"Srta", "Mr":"Sr", "Mrs":"Sra"
In [6]:
data.head()
Out[6]:
pclass survived name sex age sibsp parch ticket fare cabin embarked boat
Allen,
Srta.
0 1 1 0 29.0000 0 0 24160 211.3375 B5 S
Elisabeth
Walton
Allison,
Master. C22
1 1 1 1 0.9167 1 2 113781 151.5500 S
Hudson C26
Trevor
Allison,
Srta. C22
2 1 0 0 2.0000 1 2 113781 151.5500 S NaN
Helen C26
Loraine
Allison,
Sr.
C22
3 1 0 Hudson 1 30.0000 1 2 113781 151.5500 S NaN
C26
Joshua
Creighton
3 of 7 2/11/2021, 12:27 AM
TareaLab - Jupyter Notebook http://localhost:8889/notebooks/Downloads/TareaLab.ipynb#
pclass survived name sex age sibsp parch ticket fare cabin embarked boat
Allison,
Srs.
Hudson J C22
4 1 0 0 25.0000 1 2 113781 151.5500 S NaN
C (Bessie C26
Waldo
In [7]:
data["body"] = data["body"].fillna(0)
In [8]:
data.head()
Out[8]:
pclass survived name sex age sibsp parch ticket fare cabin embarked boat
Allen,
Srta.
0 1 1 0 29.0000 0 0 24160 211.3375 B5 S
Elisabeth
Walton
Allison,
Master. C22
1 1 1 1 0.9167 1 2 113781 151.5500 S
Hudson C26
Trevor
Allison,
Srta. C22
2 1 0 0 2.0000 1 2 113781 151.5500 S NaN
Helen C26
Loraine
Allison,
Sr.
C22
3 1 0 Hudson 1 30.0000 1 2 113781 151.5500 S NaN
C26
Joshua
Creighton
Allison,
Srs.
Hudson J C22
4 1 0 0 25.0000 1 2 113781 151.5500 S NaN
C (Bessie C26
Waldo
Daniels)
In [9]:
promedio_edad = data["age"].mean()
In [10]:
promedio_edad
Out[10]: 29.8811345124283
4 of 7 2/11/2021, 12:27 AM
TareaLab - Jupyter Notebook http://localhost:8889/notebooks/Downloads/TareaLab.ipynb#
In [11]:
data["age"] = data["age"].fillna(promedio_edad)
In [12]:
data.head()
Out[12]:
pclass survived name sex age sibsp parch ticket fare cabin embarked boat
Allen,
Srta.
0 1 1 0 29.0000 0 0 24160 211.3375 B5 S
Elisabeth
Walton
Allison,
Master. C22
1 1 1 1 0.9167 1 2 113781 151.5500 S
Hudson C26
Trevor
Allison,
Srta. C22
2 1 0 0 2.0000 1 2 113781 151.5500 S NaN
Helen C26
Loraine
Allison,
Sr.
C22
3 1 0 Hudson 1 30.0000 1 2 113781 151.5500 S NaN
C26
Joshua
Creighton
Allison,
Srs.
Hudson J C22
4 1 0 0 25.0000 1 2 113781 151.5500 S NaN
C (Bessie C26
Waldo
Daniels)
In [13]:
data.tail()
Out[13]:
pclass survived name sex age sibsp parch ticket fare cabin embarked
Zabour,
1304 3 0 0 14.500000 1 0 2665 14.4542 NaN
Srta. Hileni
Zabour,
1305 3 0 Srta. 0 29.881135 1 0 2665 14.4542 NaN
Thamine
Zakarian,
1306 3 0 Sr. 1 26.500000 0 0 2656 7.2250 NaN
Mapriededer
Zakarian,
1307 3 0 1 27.000000 0 0 2670 7.2250 NaN
Sr. Ortin
Zimmerman,
1308 3 0 1 29.000000 0 0 315082 7.8750 NaN
Sr. Leo
5 of 7 2/11/2021, 12:27 AM
TareaLab - Jupyter Notebook http://localhost:8889/notebooks/Downloads/TareaLab.ipynb#
1)Alto
2)medio
3)Bajo
In [20]:
data.dtypes
In [23]:
mapping = {1:"Alto", 2:"Medio", 3:"Bajo"}
data["pclass"] = data["pclass"].replace(mapping)
In [24]:
data.head()
Out[24]:
pclass survived name sex age sibsp parch ticket fare cabin embarked boat
Allen,
Srta.
0 Alto 1 0 29.0000 0 0 24160 211.3375 B5 S
Elisabeth
Walton
Allison,
Master. C22
1 Alto 1 1 0.9167 1 2 113781 151.5500 S
Hudson C26
Trevor
Allison,
Srta. C22
2 Alto 0 0 2.0000 1 2 113781 151.5500 S NaN
Helen C26
Loraine
6 of 7 2/11/2021, 12:27 AM
TareaLab - Jupyter Notebook http://localhost:8889/notebooks/Downloads/TareaLab.ipynb#
pclass survived name sex age sibsp parch ticket fare cabin embarked boat
Allison,
Sr.
C22
3 Alto 0 Hudson 1 30.0000 1 2 113781 151.5500 S NaN
C26
Joshua
Creighton
Allison,
In [ ]:
7 of 7 2/11/2021, 12:27 AM