Está en la página 1de 13

Otra aclaracin del concepto grados de libertad

Acerca de la pregunta en clase porque (n-1) Grados de libertad, vemoslo as, suponga que tenemos
una muestra de 5 nmeros cuya media fue 4. Entonces la suma de ellos debi ser 20. Pensemos
acerca de los 5 posibles nmeros dispuestos en 5 cajuelas:
sum

Primera pregunta: qu valor podra tomar la primera cajuela: suponga que fue 2
2 2

La segunda cajuela podra recibir cualquier otro nmero, por ejemplo 7


2 7 9

La tercera podra ser otro cualquiera, por ejemplo 4


2 7 4 13

La cuarta cualquiera, digamos 0


2 7 4 0 13

Ya la ltima queda condicionada por la suma, entonces solo podra ser 7


2 7 4 0 7 20

Entonces en general, tuvimos la libertad de escoger los 4 primeros nmeros, pero el ultimo ya estaba
condicionado. Podra decirse en general que con 5 nmeros tendramos 4 grados de libertad o
n 1 para estimar la media de tamao n.
En general los grados de libertad en una muestra de tamao n, resultan como el tamao muestral
menos el nmero de parmetros (p)estimados para unos datos para lograr un estadstico.

Una observacin, antes de tomar la muestra no sabemos qu valor alcanzar, por ello para calcular
y necesito las n cajuelas. Pero una vez hecha la muestra cualquier otro estadstico que use este
parmetro me resta un grado de libertad, y adicionado al concepto de nmeros numerosos, por
ejemplo si una muestra es de tamao 20, tiene una media dada, entonces quedan los n 1 = 19
grados de libertad.

Un buen resumen de estadsticos

El paquete (psych) tiene algunas funciones atractivas para ahorrar scripts.

Por ejemplo la funcin describe genera una buena tabla resumen para reportar los estadsticos de
una base de datos.
Ejemplo sea la siguiente base de datos de suelos en 100 parcelas:
reg Pppm pH Mg K MO Pend Riq asnm reg1
1 RE1 7.000 3.800 0.321 0.182 6.830 19 15 1744.7 1
2 RE1 19.040 3.700 0.379 0.241 5.080 22.2 20 1734.2 1
3 RE1 8.050 3.600 0.134 0.274 4.760 18.6 17 1726.8 1
4 RE1 5.320 4.100 0.117 0.157 3.940 21.5 24 1744.5 1
5 RE1 4.900 4.000 0.079 0.197 6.190 20.6 19 1739.6 1
6 RE1 2.100 4.000 0.042 0.137 3.750 26.4 20 1736.1 1
7 RE1 2.310 4.000 0.1 0.17 4.120 31 23 1745.1 1
8 RE1 1.400 3.700 0.102 0.179 2.750 26.3 26 1741 1
9 RE1 9.170 3.900 0.068 0.182 5.130 21.8 30 1755 1
10 RE1 4.620 4.300 0.037 0.163 5.140 27.4 20 1753.6 1
11 RE1 1.050 4.000 0.059 0.139 3.720 25.6 22 1743.9 1
12 RE1 7.980 3.500 0.08 0.128 5.480 23.5 30 1761.2 1
13 RE1 5.810 3.400 0.08 0.111 6.130 25.6 31 1761.2 1
14 RE1 5.250 3.600 0.07 0.147 3.650 22.2 27 1760.8 1
15 RE12 1.540 4.100 0.097 0.155 3.730 25.1 10 1693.4 2
16 RE12 10.150 3.900 0.074 0.158 4.160 30.9 15 1687.4 2
17 RE12 1.750 3.600 0.084 0.183 5.130 31.4 17 1680.4 2
18 RE12 2.660 3.900 0.076 0.144 3.680 23.9 9 1672.8 2
19 RE12 7.070 3.900 0.1 0.163 7.350 28.2 15 1683.2 2
20 RE12 2.660 3.800 0.134 0.162 6.420 23.5 14 1674.8 2
21 RE12 3.220 3.900 0.064 0.155 8.120 14.8 29 1679.3 2
22 RE12 2.590 4.000 0.094 0.161 6.460 28 30 1690.1 2
23 RE12 1.820 3.800 0.113 0.17 6.420 26.9 20 1685.9 2
24 RE12 2.870 3.900 0.074 0.152 3.480 25.3 13 1679.2 2
25 RE12 4.060 4.000 0.057 0.147 5.100 18.3 15 1678.6 2
26 RE12 5.110 4.000 0.088 0.146 4.240 38.7 15 1676.5 2
27 RE12 1.680 4.000 0.044 0.137 4.400 33.6 16 1678.4 2
28 RE12 2.450 3.800 0.079 0.134 4.780 25.7 16 1679.4 2
29 RE12 0.98 3.800 0.061 0.11 4.650 22.8 27 1678.2 2
30 RE12 8.260 3.600 0.073 0.109 4.820 20.3 20 1674.8 2
31 RE12 9.940 3.900 0.064 0.124 4.910 31.1 17 1664 2
32 RE12 4.410 3.800 0.046 0.103 3.580 22.8 20 1668.7 2
33 RE2 5.250 3.400 0.074 0.123 23.480 25.3 8 2975.8 3
34 RE2 2.520 3.400 0.255 0.028 25.320 27.2 7 2977 3
35 RE2 5.880 3.500 0.174 0.018 33.790 20.6 8 2981.2 3
36 RE2 7.280 3.800 0.232 0.057 28.100 9.8 4 2980.6 3
37 RE2 5.040 3.600 0.291 0.053 34.280 29.2 3 2978 3
38 RE2 6.020 3.500 0.14 0.201 37.130 21.6 2 2983.2 3
39 RE2 7.350 3.900 0.126 0.152 19.480 20.1 2 2981.5 3
40 RE2 0.98 3.600 0.05 0.199 34.150 24.4 7 2977 3
41 RE2 7.840 3.600 0.03 0.119 36.690 19.9 2 2976.6 3
42 RE2 2.520 3.600 0.029 0.126 30.250 13 2 2979.1 3
43 RE2 18.690 3.500 0.182 0.241 34.290 23.8 8 2965.2 3
44 RE2 5.040 3.600 0.032 0.186 17.450 23.8 10 2972.4 3
45 RE2 5.880 3.700 0.029 0.198 12.430 21.3 3 2976.5 3
46 RE2 2.520 3.600 0.025 0.152 24.370 11.1 2 2980.8 3
47 RE2 0.98 3.500 0.03 0.192 36.210 20.4 10 2963.7 3
48 RE2 5.110 3.500 0.035 0.162 29.370 22.4 1 2972.1 3
49 RE2 1.680 3.600 0.184 0.158 24.350 18 4 2977.6 3
50 RE2 2.030 3.600 0.184 0.165 21.130 26.3 8 2977.3 3
51 RE2 5.320 3.900 0.182 0.184 19.140 21.3 8 3053.2 3
52 RE2 2.170 3.500 0.181 0.152 35.350 24.4 11 3060.5 3
53 RE2 3.010 3.300 0.07 0.198 36.120 27 9 3069.6 3
54 RE2 7.980 3.700 0.07 0.072 20.080 32.1 10 3070.7 3
55 RE2 4.130 3.600 0.146 0.223 20.170 41.7 10 3074 3
56 RE2 2.310 3.500 0.071 0.495 18.230 24.4 9 3062 3
57 RE2 2.450 3.500 0.067 0.152 21.220 35.4 9 3079.9 3
58 RE2 4.060 3.900 0.139 0.186 13.540 22.9 9 3058 3
59 RE2 4.130 3.600 0.17 0.197 22.720 29.2 9 3066.2 3
60 RE2 9.520 3.600 0.145 0.153 21.340 30.9 13 3075.6 3
61 RE2 1.050 3.700 0.182 0.192 19.350 33.2 10 3060.4 3
62 RE2 1.890 3.700 0.184 0.166 23.460 35.7 11 3082.5 3
63 RE2 5.250 3.800 0.118 0.152 24.350 45.7 9 3095.7 3
64 RE2 1.960 3.900 0.118 0.184 22.160 31.5 6 3107.4 3
65 RE2 4.480 3.600 0.072 0.177 34.790 30.1 12 3066.6 3
66 RE2 6.160 3.800 0.118 0.169 23.420 41 14 3075.7 3
67 RE2 2.590 3.800 0.157 0.009 19.740 47 14 3089.9 3
68 RE2 6.720 4.200 0.025 0.121 10.360 35.9 8 3117.3 3
69 RE3 6.220 3.500 0.399 0.238 27.330 35.3 14 2035.6 4
70 RE3 5.730 3.800 0.368 0.273 28.950 20.7 16 2044.5 4
71 RE3 13.180 3.700 0.3 0.193 28.090 28.9 16 2049.1 4
72 RE3 5.100 5.200 2.176 0.263 17.870 43.4 14 2065.1 4
73 RE3 12.110 3.700 0.265 0.189 23.690 29.1 8 2044.7 4
74 RE3 18.760 6.200 2.397 0.202 21.780 32.3 12 2056.7 4
75 RE3 7.630 4.400 1.124 0.251 27.330 38.5 7 2067.8 4
76 RE3 14.000 4.400 0.957 0.351 21.500 38.4 8 2078.9 4
77 RE3 19.730 5.100 0.687 0.195 18.950 39.8 17 2055.2 4
78 RE3 18.130 4.600 0.637 0.523 20.090 36.1 7 2069.1 4
79 RE3 14.770 6.400 5.435 0.415 23.790 41.3 20 2019 4
80 RE3 19.260 5.000 2.097 0.271 24.350 44.8 26 2035.5 4
81 RE3 17.620 4.200 0.245 0.263 22.770 43.4 18 2053.2 4
82 RE3 14.140 4.800 0.49 0.259 19.740 38.6 23 2066.5 4
83 RE3 20.010 6.200 5.316 0.162 21.520 46.2 18 2012.7 4
84 RE3 12.430 5.300 2.378 0.191 23.340 42.9 17 2045.2 4
85 RE31 0.91 4.250 0.286 0.199 5.694 28.1 11.9 2012.3 5
86 RE31 0.84 4.730 0.88 0.23 5.071 36.7 13.9 2014.2 5
87 RE31 0.7 5.460 3.660 0.133 5.783 38.8 23.4 2012.4 5
88 RE31 0.7 4.240 0.146 0.153 7.829 36.5 8.1 2023.7 5
89 RE31 1.260 4.370 0.115 0.216 5.961 35.8 12 2025.5 5
90 RE31 0.84 4.920 1.217 0.442 5.605 40.4 19.4 2023.8 5
91 RE31 0.56 4.080 0.201 0.492 9.520 35.6 18.4 2036.8 5
92 RE31 0.56 4.510 0.175 0.133 10.765 38.4 11.9 2039.3 5
93 RE31 1.050 4.710 0.158 0.165 10.854 29.3 13.8 2041.3 5
94 RE31 1.120 4.210 0.409 0.244 5.694 34.1 11.3 2042.8 5
95 RE31 1.050 4.150 0.21 0.168 5.783 35.6 8.5 2048 5
96 RE31 0.91 4.370 0.177 0.156 6.940 32.3 9.7 2051.3 5
97 RE31 0.49 4.240 0.286 0.254 4.715 29.9 10 2054.5 5
98 RE31 0.56 4.280 0.296 0.279 5.516 37.4 16.3 2055.3 5
99 RE31 0.7 4.620 0.204 0.394 10.765 26.5 7.8 2063.9 5
100 RE31 0.7 4.270 0.286 0.345 11.744 29.8 10.7 2067.8 5

suelos<-read.table("clipboard")
attach(suelos)
names(suelos)
[1] "reg" "Pppm" "pH" "Mg" "K" "MO" "Pend" "Riq" "asnm" "reg1"
> str(suelos)
'd
$ reg : Factor w/ 5 levels "RE1","RE12","RE2",..: 1 1 1 1 1 1 1 1 1 1 ...
$ Pppm: num 7 19.04 8.05 5.32 4.9 ...
$ pH : num 3.8 3.7 3.6 4.1 4 4 4 3.7 3.9 4.3 ...
$ Mg : num 0.321 0.379 0.134 0.117 0.079 0.042 0.1 0.102 0.068 0.037 ...
$ K : num 0.182 0.241 0.274 0.157 0.197 0.137 0.17 0.179 0.182 0.163 ...
$ MO : num 6.83 5.08 4.76 3.94 6.19 3.75 4.12 2.75 5.13 5.14 ...
$ Pend: num 19 22.2 18.6 21.5 20.6 26.4 31 26.3 21.8 27.4 ...
$ Riq : num 15 20 17 24 19 20 23 26 30 20 ...
$ asnm: num 1745 1734 1727 1744 1740 ...
$ reg1: int 1 1 1 1 1 1 1 1 1 1 ...

library(psych)

describe(suelos[2:9])#resumen de estadsticos importantes en forma tabular. [2:9], para


eliminar la primera variables categrica y la ultima que no importa ac.

vars n mean sd median trimmed mad min max range


Pppm 1 100 5.57 5.20 4.27 4.62 4.05 0.49 20.01 19.52
pH 2 100 4.02 0.60 3.90 3.92 0.44 3.30 6.40 3.10
Mg 3 100 0.41 0.91 0.14 0.18 0.10 0.02 5.43 5.41
K 4 100 0.19 0.09 0.17 0.18 0.04 0.01 0.52 0.51
MO 5 100 15.55 10.66 12.98 14.58 12.24 2.75 37.13 34.38
Pend 6 100 29.16 8.19 28.15 28.94 8.82 9.80 47.00 37.20
Riq 7 100 13.62 7.17 12.50 13.21 6.67 1.00 31.00 30.00
asnm 8 100 2290.25 572.13 2048.55 2267.63 539.67 1664.00 3117.30 1453.30
skew kurtosis se
Pppm 1.38 1.07 0.52
pH 1.93 4.16 0.06
Mg 3.98 16.75 0.09
K 1.53 3.42 0.01
MO 0.43 -1.17 1.07
Pend 0.16 -0.57 0.82
Riq 0.51 -0.28 0.72
asnm 0.41 -1.63 57.21

La funcin describe.by combina describe con with. La funcin by propicia aun tablas ms detalladas.
Por ejemplo, regiones con pH <4.

describeBy(suelos[2:9],suelos$pH <4)#forma dos grupos incluyendo pH>4


group: FALSE
vars n mean sd median trimmed mad min max range skew
Pppm 1 40 5.91 6.80 2.21 4.89 2.44 0.49 20.01 19.52 1.02
pH 2 40 4.55 0.63 4.29 4.43 0.43 4.00 6.40 2.40 1.50
Mg 3 40 0.84 1.34 0.23 0.52 0.25 0.02 5.43 5.41 2.21
K 4 40 0.23 0.10 0.20 0.21 0.08 0.12 0.52 0.40 1.28
MO 5 40 11.06 7.62 6.70 10.26 4.28 3.72 27.33 23.61 0.74
Pend 6 40 34.08 6.88 35.70 34.37 6.97 18.30 46.20 27.90 -0.37
Riq 7 40 15.40 5.86 15.00 15.09 6.97 7.00 30.00 23.00 0.38
asnm 8 40 1980.87 237.22 2036.15 1974.32 35.29 1676.50 3117.30 1440.80 2.37
kurtosis se
Pppm -0.59 1.07
pH 1.52 0.10
Mg 4.24 0.21
K 0.74 0.02
MO -1.11 1.20
Pend -0.68 1.09
Riq -0.73 0.93
asnm 10.63 37.51
------------------------------------------------------------
group: TRUE
vars n mean sd median trimmed mad min max range
Pppm 1 60 5.34 3.84 5.04 4.79 3.74 0.98 19.04 18.06
pH 2 60 3.67 0.16 3.65 3.68 0.22 3.30 3.90 0.60
Mg 3 60 0.13 0.09 0.11 0.12 0.06 0.02 0.40 0.37
K 4 60 0.16 0.07 0.16 0.16 0.04 0.01 0.50 0.49
MO 5 60 18.55 11.37 20.12 18.23 18.37 2.75 37.13 34.38
Pend 6 60 25.88 7.35 24.40 25.43 5.78 9.80 47.00 37.20
Riq 7 60 12.43 7.74 10.00 11.69 6.67 1.00 31.00 30.00
asnm 8 60 2496.51 635.53 2976.15 2525.54 150.71 1664.00 3107.40 1443.40
skew kurtosis se
Pppm 1.56 2.98 0.50
pH -0.09 -0.99 0.02
Mg 1.21 0.82 0.01
K 1.36 7.13 0.01
MO 0.03 -1.40 1.47
Pend 0.64 0.88 0.95
Riq 0.76 -0.06 1.00
asnm -0.36 -1.84 82.05

El Segundo elemento de este comando podra ser una variable categrica, ejemplo reg.

describeBy(datos,VAnominal)# resmenes por una va nominal

describeBy(suelos[2:9],reg)
group: RE1
vars n mean sd median trimmed mad min max range skew
Pppm 1 14 6.00 4.54 5.29 5.33 4.05 1.05 19.04 17.99 1.48
pH 2 14 3.83 0.26 3.85 3.83 0.22 3.40 4.30 0.90 0.01
Mg 3 14 0.12 0.10 0.08 0.10 0.03 0.04 0.38 0.34 1.62
K 4 14 0.17 0.04 0.17 0.17 0.03 0.11 0.27 0.16 0.85
MO 5 14 4.76 1.16 4.92 4.76 1.59 2.75 6.83 4.08 0.10
Pend 6 14 23.69 3.50 22.85 23.51 4.08 18.60 31.00 12.40 0.34
Riq 7 14 23.14 5.05 22.50 23.17 5.19 15.00 31.00 16.00 0.15
asnm 8 14 1746.26 10.74 1744.60 1746.64 12.97 1726.80 1761.20 34.40 0.01
kurtosis se
Pppm 2.13 1.21
pH -1.16 0.07
Mg 1.18 0.03
K -0.04 0.01
MO -1.16 0.31
Pend -0.87 0.93
Riq -1.36 1.35
asnm -1.22 2.87
------------------------------------------------------------
group: RE12
vars n mean sd median trimmed mad min max range skew
Pppm 1 18 4.07 2.89 2.77 3.88 1.71 0.98 10.15 9.17 0.99
pH 2 18 3.87 0.13 3.90 3.88 0.15 3.60 4.10 0.50 -0.53
Mg 3 18 0.08 0.02 0.08 0.08 0.02 0.04 0.13 0.09 0.56
K 4 18 0.15 0.02 0.15 0.15 0.02 0.10 0.18 0.08 -0.46
MO 5 18 5.08 1.35 4.80 4.99 1.27 3.48 8.12 4.64 0.74
Pend 6 18 26.18 5.72 25.50 26.11 4.00 14.80 38.70 23.90 0.14
Riq 7 18 17.67 5.88 16.00 17.44 3.71 9.00 30.00 21.00 0.78
asnm 8 18 1679.17 7.20 1678.90 1679.23 6.08 1664.00 1693.40 29.40 0.00
kurtosis se
Pppm -0.47 0.68
pH -0.28 0.03
Mg -0.20 0.01
K -0.81 0.01
MO -0.60 0.32
Pend -0.33 1.35
Riq -0.34 1.39
asnm -0.32 1.70
------------------------------------------------------------
group: RE2
vars n mean sd median trimmed mad min max range skew
Pppm 1 36 4.66 3.29 4.30 4.29 2.70 0.98 18.69 17.71 2.10
pH 2 36 3.64 0.18 3.60 3.63 0.15 3.30 4.20 0.90 0.83
Mg 3 36 0.12 0.07 0.12 0.12 0.09 0.02 0.29 0.27 0.30
K 4 36 0.16 0.08 0.16 0.16 0.05 0.01 0.50 0.49 1.44
MO 5 36 25.22 7.50 23.47 25.38 6.64 10.36 37.13 26.77 0.11
Pend 6 36 26.88 8.77 24.85 26.64 6.82 9.80 47.00 37.20 0.38
Riq 7 36 7.56 3.65 8.00 7.53 2.97 1.00 14.00 13.00 -0.26
asnm 8 36 3026.13 51.97 3018.20 3023.98 63.90 2963.70 3117.30 153.60 0.15
kurtosis se
Pppm 6.68 0.55
pH 0.92 0.03
Mg -0.86 0.01
K 6.12 0.01
MO -1.08 1.25
Pend -0.18 1.46
Riq -0.97 0.61
asnm -1.76 8.66
------------------------------------------------------------
group: RE3
vars n mean sd median trimmed mad min max range skew
Pppm 1 16 13.68 5.19 14.07 13.84 6.49 5.10 20.01 14.91 -0.38
pH 2 16 4.78 0.93 4.70 4.76 0.82 3.50 6.40 2.90 0.34
Mg 3 16 1.58 1.68 0.82 1.40 0.80 0.24 5.43 5.19 1.29
K 4 16 0.26 0.09 0.26 0.25 0.08 0.16 0.52 0.36 1.36
MO 5 16 23.19 3.36 23.05 23.16 3.35 17.87 28.95 11.08 0.23
Pend 6 16 37.48 6.89 38.55 38.06 6.82 20.70 46.20 25.50 -0.84
Riq 7 16 15.06 5.63 16.00 14.86 4.45 7.00 26.00 19.00 0.07
asnm 8 16 2049.93 18.17 2051.15 2050.51 21.72 2012.70 2078.90 66.20 -0.44
kurtosis se
Pppm -1.36 1.30
pH -1.16 0.23
Mg 0.41 0.42
K 1.14 0.02
MO -1.21 0.84
Pend -0.15 1.72
Riq -0.97 1.41
asnm -0.71 4.54
------------------------------------------------------------
group: RE31
vars n mean sd median trimmed mad min max range skew
Pppm 1 16 0.81 0.23 0.77 0.80 0.26 0.49 1.26 0.77 0.36
pH 2 16 4.46 0.36 4.33 4.42 0.21 4.08 5.46 1.38 1.35
Mg 3 16 0.54 0.88 0.25 0.35 0.11 0.12 3.66 3.54 2.75
K 4 16 0.25 0.11 0.22 0.24 0.09 0.13 0.49 0.36 0.82
MO 5 16 7.39 2.46 5.87 7.27 1.39 4.71 11.74 7.03 0.61
Pend 6 16 34.08 4.22 35.60 34.16 4.45 26.50 40.40 13.90 -0.33
Riq 7 16 12.94 4.42 11.90 12.56 3.11 7.80 23.40 15.60 0.87
asnm 8 16 2038.31 18.05 2040.30 2038.06 22.09 2012.30 2067.80 55.50 -0.03
kurtosis se
Pppm -1.12 0.06
pH 1.21 0.09
Mg 6.79 0.22
K -0.68 0.03
MO -1.43 0.62
Pend -1.34 1.06
Riq -0.29 1.11
asnm -1.39 4.51

Graficos simples. Otra manera de describir visualmente unos datos con boxplots, histogramas,
matrices de diagramas de dispersin,y correlacio entre paresde variables, y funcin pairs
una matriz de grficas) e incluso histogramas para todas las variables.

boxplot(suelos[3:5],main="Box plot para pH, Mg,K") ##boxplot para 3 variables numricas


seleccionadas con algn criterio
Box plot para pH, Mg,K

6
5
4
3
2
1
0

pH Mg K

hist(pH)#
Histogram of pH
50
40
Frequency

30
20
10
0

3.0 3.5 4.0 4.5 5.0 5.5 6.0 6.5

pH

plot(pH,MO,main="diagrama de dispersion entre pH y MO")#diagrama de dispersin entre dos va


diagrama de dispersion entre pH y MO
35
30
25
MO

20
15
10
5

3.5 4.0 4.5 5.0 5.5 6.0 6.5

pH

pairs(suelos[2:6], main="Parejas de dispersin") #matriz diagrama de dispersin entre pares


de va
Parejas de dispersin
3.5 4.5 5.5 6.5 0.0 0.2 0.4

10 15 20
Pppm

5
0
6.5
5.5
pH

4.5
3.5

3 4 5
Mg

0 1 2
0.4
K
0.2
0.0

35
25
MO

15
5
0 5 10 15 20 0 1 2 3 4 5 5 15 25 35

multi.hist(suelos[2:9],bcol="gray",dcol= c("blue","red"),dlty=c("dotted", "solid"))#las


opciones bcol,decol,etc sern a gusto del necesitado

Histogram, Density, and Normal Fit Histogram, Density, and Normal Fit Histogram, Density, and Normal Fit
1.2

4
3
0.8
Density

Density

Density
0.10

2
0.4

1
0.00

0.0

0 5 10 15 20 3.0 4.0 5.0 6.0 0 1 2 3 4 5

Pppm pH Mg

Histogram, Density, and Normal Fit Histogram, Density, and Normal Fit Histogram, Density, and Normal Fit
0.06

0.06
8 10

0.04

0.04
Density

Density

Density
2 4 6

0.02

0.02
0.00

0.00
0

0.0 0.1 0.2 0.3 0.4 0.5 0 10 20 30 40 10 20 30 40 50

K MO Pend

Histogram, Density, and Normal Fit Histogram, Density, and Normal Fit
0.0010
Density

Density
0.04

0.0000
0.00

0 5 10 20 30 2000 2500 3000

Riq asnm

pairs.panels(suelos[2:9],main="multiples opciones de visualizacion de datos")


multiples opciones de visualizacion de datos
3.5 5.0 6.5 0.0 0.3 10 30 2000 3000

20
Pppm
0.36 0.42 0.18 0.27 0.22 0.12

10
-0.08

0
3.5 5.0 6.5

pH
0.83 0.35 -0.15 0.55 0.21 -0.34

Mg

4
0.25 0.11 0.46 0.17 -0.14

2
0
K
0.3

-0.01 0.26 0.02 -0.14


0.0

20 35
MO
0.00 -0.56 0.75

5
Pend
30

0.12 -0.06
10

Riq

25
-0.68

0 10
3000

asnm
2000

0 10 20 0 2 4 5 20 35 0 10 25

Correlaciones y grficas. Con el comando pairs y las correlaciones de Pearson. Redondear a 2


decimales:

round(cor(suelos[2:9],use="pairwise"),2) #Encuentra la matriz de correlaciones con 2


decimales
Pppm pH Mg K MO Pend Riq asnm
Pppm 1.00 0.36 0.42 0.18 0.27 0.22 0.12 -0.08
pH 0.36 1.00 0.83 0.35 -0.15 0.55 0.21 -0.34
Mg 0.42 0.83 1.00 0.25 0.11 0.46 0.17 -0.14
K 0.18 0.35 0.25 1.00 -0.01 0.26 0.02 -0.14
MO 0.27 -0.15 0.11 -0.01 1.00 0.00 -0.56 0.75
Pend 0.22 0.55 0.46 0.26 0.00 1.00 0.12 -0.06
Riq 0.12 0.21 0.17 0.02 -0.56 0.12 1.00 -0.68
asnm -0.08 -0.34 -0.14 -0.14 0.75 -0.06 -0.68 1.00

Convertir a zetas todos los valores de la base de datos.

zsuelos=data.frame(scale(suelos[,2:9]))#
head(zsuelos)
Pppm pH Mg K MO Pend Riq asnm
1 0.27529427 -0.3727131 -0.10165732 -0.08547404 -0.8185157 -1.2395522 0.1924357 -0.9535506
2 2.58959601 -0.5397742 -0.03814887 0.56607227 -0.9827370 -0.8489498 0.8901722 -0.9719031
3 0.47712291 -0.7068352 -0.30641731 0.93049647 -1.0127660 -1.2883775 0.4715303 -0.9848373
4 -0.04763156 0.1284699 -0.32503186 -0.36155299 -1.0897155 -0.9343941 1.4483613 -0.9539001
5 -0.12836302 -0.0385911 -0.36664084 0.08017333 -0.8785738 -1.0442510 0.7506249 -0.9624646
6 -0.66657272 -0.0385911 -0.40715485 -0.58241614 -1.1075452 -0.3362842 0.8901722 -0.9685822

Interacciones.

Considerar la interaccin de variables categricas con va continuas y otros aspectos


genricos, en resmenes de datos menores, selecciones deseadas, etc. Por ejmplo ver lo
ocurrido con pH Y Mg

Ejemplo sea la variable dependiente= Mg, la independiente pH y la categrica independiente


la regin. Podemos construir un nuevo dataframe con ellas

suelos1=data.frame(Mg,pH,reg)#hacemos un dataframe con las 3 variables

str(suelos1)
'data.frame': 100 obs. of 3 variables:
$ Mg : num 0.321 0.379 0.134 0.117 0.079 0.042 0.1 0.102 0.068 0.037 ...
$ pH : num 3.8 3.7 3.6 4.1 4 4 4 3.7 3.9 4.3 ...
$ reg: Factor w/ 5 levels "RE1","RE12","RE2",..: 1 1 1 1 1 1 1 1 1 1 ...

Head(suelos1)
Mg pH reg
1 0.321 3.80 RE1
2 0.379 3.70 RE1
3 0.134 3.60 RE1
4 0.117 4.10 RE1
5 0.079 4.00 RE1

Zsuelos1=data.frame(scale(suelos1[1:2]))#volver las va cont zetas en suelos1, que permite


convertir a z para el posterior anlisis de regresin para hacer el trmino de la nteraccion
independente de los efectos principales, dependiendo de las necesidades.

str(Zsuelos1)
'data.frame': 100 obs. of 2 variables:
$ Mg: num -0.1017 -0.0381 -0.3064 -0.325 -0.3666 ...
$ pH: num -0.3727 -0.5398 -0.7068 0.1285 -0.0386 ...

head(Zsuelos1)
Mg pH
1 -0.10165732 -0.3727131
2 -0.03814887 -0.5397742
3 -0.30641731 -0.7068352
4 -0.32503186 0.1284699
5 -0.36664084 -0.0385911
6 -0.40715485 -0.0385911

Hacer un modelo de regresin con el uso de variables categricas: variable dependiente vs va


continua y una va categrica
Por ejemplo:
model=lm(vd~vci+vcai+ vci*vcai,data=zdata)# prueba los efectos principales de vci y vcai y
la interaccion de ambas

model5=lm(Mg~pH+PH*reg,data=Zsuelos1)
model5

Call:
lm(formula = Mg ~ pH + pH * reg, data = Zsuelos1)

Coefficients:
(Intercept) pH regRE12 regRE2 regRE3 regRE31 pH:regRE12
pH:regRE2
-0.34180 -0.05882 -0.02662 0.02046 0.32301 -0.52253 0.05174
0.05961
pH:regRE3 pH:regRE31
1.08134 1.42867

summary(model5)

Call:
lm(formula = Mg ~ pH + pH * reg, data = Zsuelos1)

Residuals:
Min 1Q Median 3Q Max
-1.52833 -0.06348 -0.00354 0.06865 1.66790

Coefficients:
Estimate Std. Error t value Pr(>|t|)
(Intercept) -0.34180 0.15040 -2.273 0.025435 *
pH -0.05882 0.28693 -0.205 0.838025
regRE12 -0.02662 0.22010 -0.121 0.903988
regRE2 0.02046 0.23163 0.088 0.929797
regRE3 0.32301 0.20864 1.548 0.125095
regRE31 -0.52253 0.23375 -2.235 0.027866 *
pH:regRE12 0.05174 0.56414 0.092 0.927128
pH:regRE2 0.05961 0.38132 0.156 0.876132
pH:regRE3 1.08134 0.29625 3.650 0.000439 ***
pH:regRE31 1.42867 0.34500 4.141 7.78e-05 ***
---
Signif. codes: 0 *** 0.001 ** 0.01 * 0.05 . 0.1 1

Residual standard error: 0.4416 on 90 degrees of freedom


Multiple R-squared: 0.8227, Adjusted R-squared: 0.805
F-statistic: 46.42 on 9 and 90 DF, p-value: < 2.2e-16

Resumen estadstico del modelo para cada valor de la variable aleatoria categrica

by(suelos,reg,function(x) summary(lm(Mg~pH,data=zsuelos2))) #resumen estadistico del modelo


para cada valor de vcai

by(Zsuelos1,reg,function(x) summary(lm(Mg~pH,data=Zsuelos1))) #resumen estadistico del


modelo para cada valor de vcai

by(Zsuelos1,reg,function(x) summary(lm(Mg~pH,data=Zsuelos1)))
reg: RE1

Call:
lm(formula = Mg ~ pH, data = Zsuelos1)

Residuals:
Min 1Q Median 3Q Max
-1.22885 -0.35718 0.01329 0.30265 2.36111

Coefficients:
Estimate Std. Error t value Pr(>|t|)
(Intercept) 3.115e-16 5.655e-02 0.00 1
pH 8.267e-01 5.683e-02 14.55 <2e-16 ***
---
Signif. codes: 0 *** 0.001 ** 0.01 * 0.05 . 0.1 1

Residual standard error: 0.5655 on 98 degrees of freedom


Multiple R-squared: 0.6835, Adjusted R-squared: 0.6803
F-statistic: 211.6 on 1 and 98 DF, p-value: < 2.2e-16

------------------------------------------------------------------------------------
reg: RE12

Call:
lm(formula = Mg ~ pH, data = Zsuelos1)
Residuals:
Min 1Q Median 3Q Max
-1.22885 -0.35718 0.01329 0.30265 2.36111

Coefficients:
Estimate Std. Error t value Pr(>|t|)
(Intercept) 3.115e-16 5.655e-02 0.00 1
pH 8.267e-01 5.683e-02 14.55 <2e-16 ***
---
Signif. codes: 0 *** 0.001 ** 0.01 * 0.05 . 0.1 1

Residual standard error: 0.5655 on 98 degrees of freedom


Multiple R-squared: 0.6835, Adjusted R-squared: 0.6803
F-statistic: 211.6 on 1 and 98 DF, p-value: < 2.2e-16

------------------------------------------------------------------------------------
reg: RE2

Call:
lm(formula = Mg ~ pH, data = Zsuelos1)

Residuals:
Min 1Q Median 3Q Max
-1.22885 -0.35718 0.01329 0.30265 2.36111

Coefficients:
Estimate Std. Error t value Pr(>|t|)
(Intercept) 3.115e-16 5.655e-02 0.00 1
pH 8.267e-01 5.683e-02 14.55 <2e-16 ***
---
Signif. codes: 0 *** 0.001 ** 0.01 * 0.05 . 0.1 1

Residual standard error: 0.5655 on 98 degrees of freedom


Multiple R-squared: 0.6835, Adjusted R-squared: 0.6803
F-statistic: 211.6 on 1 and 98 DF, p-value: < 2.2e-16

------------------------------------------------------------------------------------
reg: RE3

Call:
lm(formula = Mg ~ pH, data = Zsuelos1)

Residuals:
Min 1Q Median 3Q Max
-1.22885 -0.35718 0.01329 0.30265 2.36111

Coefficients:
Estimate Std. Error t value Pr(>|t|)
(Intercept) 3.115e-16 5.655e-02 0.00 1
pH 8.267e-01 5.683e-02 14.55 <2e-16 ***
---
Signif. codes: 0 *** 0.001 ** 0.01 * 0.05 . 0.1 1

Residual standard error: 0.5655 on 98 degrees of freedom


Multiple R-squared: 0.6835, Adjusted R-squared: 0.6803
F-statistic: 211.6 on 1 and 98 DF, p-value: < 2.2e-16

------------------------------------------------------------------------------------
reg: RE31

Call:
lm(formula = Mg ~ pH, data = Zsuelos1)

Residuals:
Min 1Q Median 3Q Max
-1.22885 -0.35718 0.01329 0.30265 2.36111

Coefficients:
Estimate Std. Error t value Pr(>|t|)
(Intercept) 3.115e-16 5.655e-02 0.00 1
pH 8.267e-01 5.683e-02 14.55 <2e-16 ***
---
Signif. codes: 0 *** 0.001 ** 0.01 * 0.05 . 0.1 1

Residual standard error: 0.5655 on 98 degrees of freedom


Multiple R-squared: 0.6835, Adjusted R-squared: 0.6803
F-statistic: 211.6 on 1 and 98 DF, p-value: < 2.2e-16

par(mfrow=c(1,1)) #un solo grafico por pantallazo


symb=c(19,25,3,23) #escoja los smbolos de su agrado
colors=c("black","red","green","blue") # escoja los colores de su agrado

plot(pH,Mg,pch=symb[reg1],col=colors[reg1],cex=1.0,xlab="pH",ylab="Mg",main="Grafica pH vs
Mg")

Grafica pH vs Mg
5
4
3
Mg

2
1
0

3.5 4.0 4.5 5.0 5.5 6.0 6.5

pH

También podría gustarte