Está en la página 1de 2



Max. Marks : 60
Sub. Code : SE7204

: 95 Min


: I ME(SE)

Sub. Name : Big Data Analytics

Answer ALL Questions
PART A (15 x 2 = 30 Marks)
1.1 What is Big Data?
1.2 Why Intelligent Data Analysis is needed?
1.3 What are the Challenges of Conventional System?
1.4 What is Inferential Analysis?
1.5 List some of the Modern Analytic tools.
1.6 Define statistics and its steps.
1.7 State the Classical definition of Probability with example.
1.8 State the Frequency definition of Probability with example.
1.9 Define Entropy.
1.10 State the Expectation for a discrete random variable.
1.11Differentiate Point Estimation and Interval Estimation
1.12 Give example for Type I and Type II Errors.
1.13 Illustrate prediction error with example.
1.14 In Classification how prediction error is calculated.
1.15 List the applications of Cross Validation.
Answer ALL Questions
PART B (2 x 15=30 Marks)
Question No. 16 is Compulsory
1.16 Explain statistical inference methods with a simple data set.
1.17 a) (i) The Acme Widget Company claims that their widgets last 5 years, with a

standard deviation of 1 year. Assume that their claims are true. If you test a random
sample of 9 Acme widgets, what is the probability that the standard deviation in your
sample will be less than 0.95 years?

We know the following:

The population standard deviation is equal to 1.

The sample standard deviation is equal to 0.95.

The sample size is equal to 9.

The degrees of freedom is equal to 8 (because sample size minus one = 9 - 1 = 8).

Given these data, we compute the chi-square statistic:

2 = [ ( n - 1 ) * s2 ] / 2

= [ ( 9 - 1 ) * (0.95)2 ] / (1.0)2 = 7.22

where is the standard deviation of the population, s is the standard deviation of the
sample, and n is the sample size.
Now, using the Chi-Square Distribution Calculator, we can determine the cumulative
probability for the chi-square statistic. We enter the degrees of freedom (8) and the
chi-square statistic (7.22) into the calculator, and hit the Calculate button. The
calculator reports that the cumulative probability is 0.49. Therefore, there is a 49%
chance that the sample standard deviation will be no more than 0.95

(ii) Explain Prediction Error Methods in detail.

(b) Explain in detail about Re-sampling Methods.