CBSE Explorer

Statistics

AI Learning Assistant

I can help you understand Statistics better. Ask me anything!

Summarize the main points of Statistics.
What are the most important terms to remember here?
Explain this concept like I'm five.
Give me a quick 3-question practice quiz.

Summary

Summary of Statistics

Measures of Central Tendency

  • Mean: Average of all observations.
  • Median: Middle value that divides the data into two halves.
  • Mode: Most frequently occurring value in the dataset.

Finding Mean

  • Example 1: Mean heartbeats per minute for women.
  • Example 2: Mean daily expenditure on food.
  • Example 3: Mean concentration of SOâ‚‚ in air.

Finding Median

  • Formula:
    Median = l + (n/2 - cf) * h / f
    Where:
    • l = lower limit of median class
    • n = total number of observations
    • cf = cumulative frequency of class preceding median class
    • f = frequency of median class
    • h = class size

Finding Mode

  • Formula:
    Mode = l + [(f1 - f0) / (2f1 - f0 - f2)] * h
    Where:
    • l = lower limit of modal class
    • f1 = frequency of modal class
    • f0 = frequency of class preceding modal class
    • f2 = frequency of class succeeding modal class
    • h = class size

Cumulative Frequency

  • Definition: The total frequency accumulated up to a certain point in the dataset.
  • Example: Cumulative frequency table for marks obtained by students.

Common Pitfalls

  • Mean can be skewed by extreme values.
  • Median is preferred in skewed distributions.
  • Mode is useful for identifying the most common value but may not represent the data well if there are multiple modes.

Tips

  • Always check the distribution shape before choosing a measure of central tendency.
  • Use cumulative frequency for finding medians in grouped data.

Learning Objectives

  • Learning Objectives
    • Understand how to collect and organize data into grouped frequency distributions.
    • Calculate the mean from grouped data using appropriate methods.
    • Interpret the results of mean calculations in the context of real-world data.
    • Identify and apply the mode and median in grouped data distributions.
    • Analyze the impact of extreme values on the mean, median, and mode.
    • Differentiate between the use of mean, median, and mode based on the nature of the data.

Detailed Notes

Statistics Notes

Measures of Central Tendency

Mean

  • The mean is calculated using the formula:
    x=ΣfixiΣfix = \frac{\Sigma f_i x_i}{\Sigma f_i}
  • Example: For a grouped frequency distribution, the mean can be calculated by forming class intervals and using the mid-points as representative values.

Median

  • The median is calculated using the formula:
    Median=l+(n2−cff)×h\text{Median} = l + \left( \frac{\frac{n}{2} - cf}{f} \right) \times h
    • Where:
      • ll: Lower limit of the median class
      • nn: Total number of observations
      • cfcf: Cumulative frequency of the class preceding the median class
      • ff: Frequency of the median class
      • hh: Class width

Mode

  • The mode is calculated using the formula:
    Mode=l+(f1−f02f1−f0−f2)×h\text{Mode} = l + \left( \frac{f_1 - f_0}{2f_1 - f_0 - f_2} \right) \times h
    • Where:
      • ll: Lower limit of the modal class
      • f1f_1: Frequency of the modal class
      • f0f_0: Frequency of the class preceding the modal class
      • f2f_2: Frequency of the class succeeding the modal class
      • hh: Class width

Cumulative Frequency

  • Cumulative frequency is the sum of the frequencies of all classes preceding a given class.

Example Tables

Example 1: Marks Distribution

Marks obtainedNumber of studentsCumulative frequency
0 - 1055
10 - 2038
20 - 30412
30 - 40315
40 - 50318
50 - 60422
60 - 70729
70 - 80938
80 - 90745
90 - 100853

Example 2: Teacher-Student Ratio

Number of students per teacherNumber of states / U.T.
15 - 203
20 - 258
25 - 309
30 - 3510
35 - 403
40 - 450
45 - 500
50 - 552

Remarks

  1. The mean is sensitive to extreme values, while the median is more robust in such cases.
  2. The mode is useful for identifying the most frequent value in a dataset.
  3. There is an empirical relationship:
    3Median=Mode+2Mean3 \text{Median} = \text{Mode} + 2 \text{Mean}

Applications

  • Mean is used for comparing distributions.
  • Median is preferred when extreme values are present.
  • Mode is used to find the most popular or frequent item.

Exam Tips & Common Mistakes

Common Mistakes and Exam Tips

Common Pitfalls

  • Misunderstanding Mean Calculation: Students often confuse the methods for calculating the mean, especially between direct, assumed mean, and step-deviation methods. Ensure you understand when to use each method.
  • Ignoring Class Intervals: When dealing with grouped data, failing to recognize the importance of class intervals can lead to incorrect calculations of mean, median, and mode.
  • Cumulative Frequency Errors: Mistakes in calculating cumulative frequencies can affect the determination of median and other statistical measures.
  • Assuming Midpoints: In grouped data, assuming that all values are at the midpoint can lead to inaccuracies. Always consider the range of the class interval.

Tips for Avoiding Mistakes

  • Double-Check Calculations: Always verify your calculations, especially when summing frequencies or products in mean calculations.
  • Understand the Data Structure: Familiarize yourself with the data structure (ungrouped vs grouped) and the appropriate methods for each.
  • Practice with Examples: Work through multiple examples to solidify your understanding of different statistical measures and their calculations.
  • Use Tables Effectively: Organize data in tables to visualize frequencies, cumulative frequencies, and calculations clearly.
  • Clarify Definitions: Ensure you understand definitions of mean, median, and mode, and how they apply to different types of data.

Practice & Assessment

Multiple Choice Questions

A.

5

B.

15

C.

25

D.

35
Correct Answer: B

Solution:

The mode is the interval with the highest frequency. Here, the interval 10-20 has the highest frequency of 3. Therefore, the mode is the midpoint of this interval, which is 15.

A.

75.25

B.

75.50

C.

76.00

D.

74.75
Correct Answer: A

Solution:

The total score with the incorrect entry is 40×75=300040 \times 75 = 3000. Correcting the score, the total becomes 3000−85+95=30103000 - 85 + 95 = 3010. The correct mean is 301040=75.25\frac{3010}{40} = 75.25.

A.

1-4

B.

4-7

C.

7-10

D.

10-13
Correct Answer: C

Solution:

The mode is the class with the highest frequency. Here, the class 7-10 has the highest frequency of 40, making it the modal class.

A.

8.5

B.

10

C.

11.5

D.

12
Correct Answer: C

Solution:

To calculate the mean, we use the formula: xˉ=ΣfixiΣfi\bar{x} = \frac{\Sigma f_i x_i}{\Sigma f_i}. The midpoints of the intervals are 2.5, 7.5, 12.5, and 17.5. Thus, Σfixi=(3×2.5)+(7×7.5)+(15×12.5)+(5×17.5)=7.5+52.5+187.5+87.5=335\Sigma f_i x_i = (3 \times 2.5) + (7 \times 7.5) + (15 \times 12.5) + (5 \times 17.5) = 7.5 + 52.5 + 187.5 + 87.5 = 335. The total frequency Σfi=3+7+15+5=30\Sigma f_i = 3 + 7 + 15 + 5 = 30. Therefore, the mean is xˉ=33530=11.5\bar{x} = \frac{335}{30} = 11.5.

A.

25-35 years

B.

35-45 years

C.

45-55 years

D.

55-65 years
Correct Answer: B

Solution:

The mode is the class interval with the highest frequency, which is 35-45 years with 23 patients.

A.

2750 hours

B.

3250 hours

C.

3750 hours

D.

4250 hours
Correct Answer: B

Solution:

To find the median, we need to determine the cumulative frequency and locate the median class. The median class is 3000-3500 hours, as it contains the 200th lamp (n/2 = 200). Using the formula for median, the median life time is 3250 hours.

A.

400 hours

B.

450 hours

C.

475 hours

D.

500 hours
Correct Answer: B

Solution:

To find the median, we determine the cumulative frequency and locate the median class. The cumulative frequencies are 10, 40, 80, 95, 100. The median class is 400-500 hours. Using the median formula: Median=l+(n2−cff)×h\text{Median} = l + \left(\frac{\frac{n}{2} - cf}{f}\right) \times h where l=400l = 400, cf=40cf = 40, f=40f = 40, h=100h = 100, and n=100n = 100. Substituting these values, we get Median=400+(50−4040)×100=450\text{Median} = 400 + \left(\frac{50 - 40}{40}\right) \times 100 = 450 hours.

A.

150-160 cm

B.

160-170 cm

C.

170-180 cm

D.

140-150 cm
Correct Answer: B

Solution:

The total number of students is 3 + 5 + 12 + 8 + 2 = 30. The median class is the one where the cumulative frequency is greater than or equal to 30/2 = 15. The cumulative frequency reaches 20 at the class 160-170 cm, making it the median class.

A.

-1

B.

-0.5

C.

0.5

D.

1
Correct Answer: A

Solution:

The standard deviation is the square root of the variance, which is 4=2\sqrt{4} = 2. The deviation from the mean in standard deviation units is 3−52=−1\frac{3 - 5}{2} = -1.

A.

₹540

B.

₹550

C.

₹560

D.

₹520
Correct Answer: A

Solution:

The formula for the median is: Median=l+(n2−cff)×h\text{Median} = l + \left( \frac{\frac{n}{2} - cf}{f} \right) \times h. Here, the median class is ₹520-540. Thus, l=520,n=50,cf=12,f=14,h=20l = 520, n = 50, cf = 12, f = 14, h = 20. Substituting these values, we get the median as ₹540.

A.

20-30 cars

B.

30-40 cars

C.

40-50 cars

D.

50-60 cars
Correct Answer: C

Solution:

The mode is the class interval with the highest frequency. Here, the class 40-50 cars has the highest frequency of 20 periods, making it the mode.

A.

27.5 cm

B.

30 cm

C.

32.5 cm

D.

35 cm
Correct Answer: A

Solution:

To calculate the mean, use the formula: xˉ=ΣfixiΣfi\bar{x} = \frac{\Sigma f_i x_i}{\Sigma f_i}. The midpoints are 15, 25, 35, and 45. Thus, Σfixi=(4×15)+(6×25)+(8×35)+(2×45)=60+150+280+90=580\Sigma f_i x_i = (4 \times 15) + (6 \times 25) + (8 \times 35) + (2 \times 45) = 60 + 150 + 280 + 90 = 580. The total frequency Σfi=4+6+8+2=20\Sigma f_i = 4 + 6 + 8 + 2 = 20. Therefore, the mean is xˉ=58020=29\bar{x} = \frac{580}{20} = 29 cm.

A.

30 years

B.

35 years

C.

40 years

D.

45 years
Correct Answer: B

Solution:

The median is the age at which half the policyholders are younger and half are older. With 100 policyholders, the median age is the age at which the cumulative frequency first exceeds 50. The cumulative frequency exceeds 50 at the 'Below 35' category, making the median age 35 years.

A.

₹540

B.

₹550

C.

₹560

D.

₹570
Correct Answer: B

Solution:

The mean is calculated using the formula: Mean=ΣfixiΣfi\text{Mean} = \frac{\Sigma f_i x_i}{\Sigma f_i}. Here, the mean is calculated as: (510×12)+(530×14)+(550×8)+(570×6)+(590×10)50=550\frac{(510 \times 12) + (530 \times 14) + (550 \times 8) + (570 \times 6) + (590 \times 10)}{50} = 550.

A.

₹200

B.

₹220

C.

₹240

D.

₹260
Correct Answer: B

Solution:

The mean daily expenditure is calculated as (125×4)+(175×5)+(225×12)+(275×2)+(325×2)25=₹220\frac{(125 \times 4) + (175 \times 5) + (225 \times 12) + (275 \times 2) + (325 \times 2)}{25} = ₹220.

A.

125-145 units

B.

105-125 units

C.

145-165 units

D.

165-185 units
Correct Answer: A

Solution:

The mode is the class interval with the highest frequency. Here, the class interval 125-145 has the highest frequency of 20.

A.

10 hours

B.

12.5 hours

C.

15 hours

D.

11.5 hours
Correct Answer: B

Solution:

The mean is calculated as: Mean=5×2.5+10×7.5+15×12.5+10×17.55+10+15+10=12.5 hours\text{Mean} = \frac{5 \times 2.5 + 10 \times 7.5 + 15 \times 12.5 + 10 \times 17.5}{5 + 10 + 15 + 10} = 12.5 \text{ hours}.

A.

₹200

B.

₹220

C.

₹240

D.

₹260
Correct Answer: A

Solution:

Using the direct method, the mean is calculated as follows: xˉ=ΣfixiΣfi\bar{x} = \frac{\Sigma f_i x_i}{\Sigma f_i}. The mean daily expenditure is ₹200.

A.

10.5 hours

B.

12.5 hours

C.

8.5 hours

D.

11.5 hours
Correct Answer: A

Solution:

To find the mean, we use the formula: Mean=∑fixi∑fi\text{Mean} = \frac{\sum f_i x_i}{\sum f_i}. The midpoints are 2.5, 7.5, 12.5, and 17.5. The mean is (3×2.5)+(7×7.5)+(15×12.5)+(5×17.5)3+7+15+5=10.5\frac{(3 \times 2.5) + (7 \times 7.5) + (15 \times 12.5) + (5 \times 17.5)}{3 + 7 + 15 + 5} = 10.5 hours.

A.

60

B.

62.5

C.

65

D.

67.5
Correct Answer: B

Solution:

After removing the highest score (90), the scores are: 45, 50, 55, 60, 65, 70, 75, 80, 85. The median is the average of the 4th and 5th scores: (60 + 65) / 2 = 62.5.

A.

40-50

B.

10-20

C.

60-70

D.

20-30
Correct Answer: A

Solution:

The mode is the class interval with the highest frequency. Here, the interval 40-50 has the highest frequency of 20.

A.

₹2250

B.

₹2500

C.

₹2750

D.

₹3000
Correct Answer: A

Solution:

To find the mean, we calculate the sum of the midpoints of each class interval multiplied by their respective frequencies, divided by the total number of families. The mean monthly expenditure is approximately ₹2250.

A.

132 units

B.

142 units

C.

152 units

D.

162 units
Correct Answer: B

Solution:

Calculate the midpoint of each class interval, multiply by the frequency, sum these products, and divide by the total number of consumers. The mean is approximately 142 units.

A.

25-35

B.

35-45

C.

15-25

D.

45-55
Correct Answer: B

Solution:

The mode is the class interval with the highest frequency. Here, the interval 35-45 has the highest frequency of 23.

A.

40 years

B.

35 years

C.

45 years

D.

50 years
Correct Answer: A

Solution:

The mean age is calculated as: Mean=10×25+15×35+20×45+5×5510+15+20+5=40 years\text{Mean} = \frac{10 \times 25 + 15 \times 35 + 20 \times 45 + 5 \times 55}{10 + 15 + 20 + 5} = 40 \text{ years}.

A.

₹60,000

B.

₹65,000

C.

₹67,500

D.

₹70,000
Correct Answer: B

Solution:

The profits in increasing order are ₹50,000, ₹55,000, ₹60,000, ₹65,000, ₹70,000, ₹80,000. The median is the average of the 3rd and 4th values: (₹60,000 + ₹65,000)/2 = ₹62,500.

A.

₹550

B.

₹540

C.

₹530

D.

₹560
Correct Answer: B

Solution:

Mean daily wage is calculated by multiplying the mid-point of each wage interval by the number of workers and dividing by the total number of workers. Mean = (510×12 + 530×14 + 550×8 + 570×6 + 590×10) / 50 = ₹540.

A.

3250 hours

B.

3500 hours

C.

3750 hours

D.

4000 hours
Correct Answer: B

Solution:

The median is the value that divides the distribution into two equal parts. To find the median class, calculate the cumulative frequency and identify the class where the cumulative frequency exceeds half of the total frequency (200). The cumulative frequency for the class 3000-3500 is 216, which is the first to exceed 200. Using the median formula: Median = l + ((n/2 - cf)/f) * h, where l = 3000, cf = 130, f = 86, h = 500, we get Median = 3000 + ((200 - 130)/86) * 500 = 3500 hours.

A.

165 cm

B.

167 cm

C.

162 cm

D.

160 cm
Correct Answer: C

Solution:

To find the mean, we use the formula: xˉ=ΣfixiΣfi\bar{x} = \frac{\Sigma f_i x_i}{\Sigma f_i} where fif_i is the frequency and xix_i is the midpoint of each class interval. The midpoints are 145, 155, 165, 175, and 185. Calculating the sum of fixif_i x_i gives 8100, and the sum of frequencies is 50. Thus, xˉ=810050=162\bar{x} = \frac{8100}{50} = 162 cm.

A.

20-40

B.

40-60

C.

60-80

D.

0-20
Correct Answer: B

Solution:

The mean score is calculated by finding the mid-point of each class and multiplying by the frequency, then dividing by the total number of students. The class with the highest contribution to the mean is 40-60.

A.

56.5 mangoes

B.

58.5 mangoes

C.

60.5 mangoes

D.

62.5 mangoes
Correct Answer: B

Solution:

To find the mean, multiply the mid-point of each class interval by the frequency, sum these products, and divide by the total number of boxes. Mean = (51×15)+(54×110)+(57×135)+(60×115)+(63×25)400=58.5\frac{(51 \times 15) + (54 \times 110) + (57 \times 135) + (60 \times 115) + (63 \times 25)}{400} = 58.5 mangoes.

A.

4 books

B.

3.5 books

C.

5 books

D.

4.5 books
Correct Answer: D

Solution:

The mean is calculated as: Mean=3×1+5×3+7×5+5×73+5+7+5=4.5 books\text{Mean} = \frac{3 \times 1 + 5 \times 3 + 7 \times 5 + 5 \times 7}{3 + 5 + 7 + 5} = 4.5 \text{ books}.

A.

50

B.

30

C.

70

D.

40
Correct Answer: D

Solution:

The mode is the value that appears most frequently. In this grouped data, the class interval with the highest frequency is 40-60 with 20 students. Therefore, the mode is the midpoint of this interval, which is 50.

A.

4 hours

B.

5 hours

C.

6 hours

D.

4.5 hours
Correct Answer: D

Solution:

The mean is calculated as: Mean=8×1+12×3+10×5+10×78+12+10+10=4.5 hours\text{Mean} = \frac{8 \times 1 + 12 \times 3 + 10 \times 5 + 10 \times 7}{8 + 12 + 10 + 10} = 4.5 \text{ hours}.

A.

100-150 widgets

B.

150-200 widgets

C.

200-250 widgets

D.

250-300 widgets
Correct Answer: C

Solution:

The mode is the class interval with the highest frequency. Here, the class 200-250 widgets has the highest frequency of 4 days, making it the mode.

A.

₹2000-2500

B.

₹2500-3000

C.

₹3000-3500

D.

₹1500-2000
Correct Answer: D

Solution:

The modal class is the one with the highest frequency. Here, the class interval with the highest frequency is 1500-2000 with 40 families.

A.

4000-5000 runs

B.

5000-6000 runs

C.

6000-7000 runs

D.

7000-8000 runs
Correct Answer: A

Solution:

The mode is the class interval with the highest frequency. Here, the interval 4000-5000 runs has the highest frequency of 18.

A.

10.5 days

B.

12.5 days

C.

14.5 days

D.

16.5 days
Correct Answer: C

Solution:

Mean number of days absent is calculated by multiplying the mid-point of each interval by the number of students and dividing by the total number of students. Mean = (3×11 + 8×10 + 12×7 + 17×4 + 24×4 + 33×3 + 39×1) / 40 = 14.5 days.

A.

0-20 hours

B.

40-60 hours

C.

60-80 hours

D.

80-100 hours
Correct Answer: C

Solution:

The mode is the class interval with the highest frequency. Here, the class 60-80 hours has the highest frequency of 61.

A.

₹35000

B.

₹37500

C.

₹40000

D.

₹42500
Correct Answer: C

Solution:

Initially, the sum of incomes is ₹25000 + ₹30000 + ₹35000 + ₹40000 + ₹45000 = ₹175000. Adding the new income: ₹175000 + ₹50000 = ₹225000. The new mean is ₹225000 / 6 = ₹37500.

A.

₹550

B.

₹540

C.

₹560

D.

₹570
Correct Answer: B

Solution:

The mean is calculated using the formula: xˉ=ΣfixiΣfi\bar{x} = \frac{\Sigma f_i x_i}{\Sigma f_i}. Here, Σfixi=(500×12)+(520×14)+(540×8)+(560×6)+(580×10)=27000\Sigma f_i x_i = (500 \times 12) + (520 \times 14) + (540 \times 8) + (560 \times 6) + (580 \times 10) = 27000 and Σfi=50\Sigma f_i = 50. Therefore, mean = 27000/50 = ₹540.

A.

5

B.

6

C.

7

D.

8
Correct Answer: A

Solution:

Using the mean formula: Mean = (Σfᵢxᵢ) / Σfᵢ, and given that the mean is ₹18, solve for f. Calculate the total sum of allowances with the midpoints and solve for f to get f = 5.

A.

7

B.

8

C.

9

D.

10
Correct Answer: C

Solution:

The mode is the value that appears most frequently in a data set. In this data set, the number 9 appears most frequently (five times). Therefore, the mode is 9.

A.

30

B.

40

C.

50

D.

60
Correct Answer: A

Solution:

The frequency of an interval can be found by subtracting the cumulative frequency of the previous interval from the cumulative frequency of the current interval. Therefore, the frequency of the interval 40-50 is 120 - 90 = 30.

A.

Direct method

B.

Assumed mean method

C.

Step deviation method

D.

Convert to continuous classes first, then use the median formula
Correct Answer: D

Solution:

To accurately calculate the median for grouped data, the class intervals must be continuous. Therefore, converting to continuous classes first is necessary before applying the median formula.

A.

32.5 years

B.

37.5 years

C.

42.5 years

D.

47.5 years
Correct Answer: B

Solution:

The median is the age at which the cumulative frequency is greater than or equal to 50 (since 100/2 = 50). The cumulative frequency just greater than 50 is 78, which corresponds to the class 35-40. Using the median formula, we find the median to be approximately 37.5 years.

A.

165 cm

B.

160 cm

C.

170 cm

D.

175 cm
Correct Answer: A

Solution:

The mean height is calculated as follows: Mean=∑fixi∑fi=5×145+10×155+15×165+10×175+5×1855+10+15+10+5=165 cm\text{Mean} = \frac{\sum f_i x_i}{\sum f_i} = \frac{5 \times 145 + 10 \times 155 + 15 \times 165 + 10 \times 175 + 5 \times 185}{5 + 10 + 15 + 10 + 5} = 165 \text{ cm}.

A.

175 units

B.

225 units

C.

250 units

D.

200 units
Correct Answer: B

Solution:

To find the median, we first determine the cumulative frequencies: 5, 15, 30, 50. The total number of households is 50, so the median class is where the cumulative frequency exceeds 25. This occurs in the 200-250 units interval. Using the formula for median: Median=l+(n2−cff)×h\text{Median} = l + \left(\frac{\frac{n}{2} - cf}{f}\right) \times h, where l=200l = 200, cf=15cf = 15, f=15f = 15, h=50h = 50. Therefore, Median = 200+(25−1515)×50=225200 + \left(\frac{25 - 15}{15}\right) \times 50 = 225 units.

A.

0.10 ppm

B.

0.12 ppm

C.

0.08 ppm

D.

0.06 ppm
Correct Answer: C

Solution:

The mean concentration is calculated as: (0.02×4)+(0.06×9)+(0.10×9)+(0.14×2)+(0.18×4)+(0.22×2)30=0.08\frac{(0.02 \times 4) + (0.06 \times 9) + (0.10 \times 9) + (0.14 \times 2) + (0.18 \times 4) + (0.22 \times 2)}{30} = 0.08 ppm.

A.

25

B.

26.67

C.

27.5

D.

28.33
Correct Answer: B

Solution:

The median is calculated using the formula: Median=l+n2−cff×h\text{Median} = l + \frac{\frac{n}{2} - cf}{f} \times h. Here, l=20l = 20, n=45n = 45, cf=30cf = 30, f=15f = 15, and h=10h = 10. Substituting these values, Median=20+22.5−3015×10=26.67\text{Median} = 20 + \frac{22.5 - 30}{15} \times 10 = 26.67.

A.

Approximately 68%

B.

Approximately 95%

C.

Approximately 99.7%

D.

Approximately 50%
Correct Answer: A

Solution:

In a normal distribution, approximately 68% of the data falls within one standard deviation of the mean. Since 2850 to 3150 hours is one standard deviation from the mean (3000 ± 150), the probability is approximately 68%.

A.

162.5 cm

B.

163.0 cm

C.

163.5 cm

D.

164.0 cm
Correct Answer: B

Solution:

To find the mean, we use the formula: xˉ=ΣfixiΣfi\bar{x} = \frac{\Sigma f_i x_i}{\Sigma f_i}. The midpoints of the class intervals are 152.5, 157.5, 162.5, 167.5, and 172.5. Calculating the sum of products of frequencies and midpoints: 5×152.5+10×157.5+15×162.5+10×167.5+10×172.5=8152.55 \times 152.5 + 10 \times 157.5 + 15 \times 162.5 + 10 \times 167.5 + 10 \times 172.5 = 8152.5. The total frequency is 50. Therefore, the mean height is 8152.550=163.0\frac{8152.5}{50} = 163.0 cm.

A.

18

B.

22

C.

15

D.

12
Correct Answer: B

Solution:

The cumulative frequency for the class 50-60 is obtained by adding the frequencies of all preceding classes: 5 + 3 + 4 + 3 + 3 = 18. Adding the frequency of the 50-60 class, we get 18 + 4 = 22.

A.

7.5

B.

8.5

C.

6.5

D.

5.5
Correct Answer: A

Solution:

To find the mean, calculate the mid-point of each interval and multiply by the corresponding frequency, then divide by the total number of houses: Mean=(1×1)+(3×2)+(5×1)+(7×5)+(9×6)+(11×2)+(13×3)20=7.5\text{Mean} = \frac{(1 \times 1) + (3 \times 2) + (5 \times 1) + (7 \times 5) + (9 \times 6) + (11 \times 2) + (13 \times 3)}{20} = 7.5.

A.

50

B.

55

C.

60

D.

65
Correct Answer: A

Solution:

The total number of students is 30. The median is the score at the 15.5th position. The cumulative frequency table is: 0-20: 4, 20-40: 10, 40-60: 20, 60-80: 28, 80-100: 30. The median class is 40-60. Using the median formula: Median=l+(n2−cff)×h\text{Median} = l + \left(\frac{\frac{n}{2} - cf}{f}\right) \times h, where l=40l = 40, cf=10cf = 10, f=10f = 10, h=20h = 20. Therefore, Median = 40+(15−1010)×20=5040 + \left(\frac{15 - 10}{10}\right) \times 20 = 50.

A.

74.5

B.

76.5

C.

78.5

D.

80.5
Correct Answer: B

Solution:

Calculate the mean by multiplying the mid-point of each class interval by the number of women, summing these products, and dividing by the total number of women: (66.5×2)+(69.5×4)+(72.5×3)+(75.5×8)+(78.5×7)+(81.5×4)+(84.5×2)30=76.5\frac{(66.5 \times 2) + (69.5 \times 4) + (72.5 \times 3) + (75.5 \times 8) + (78.5 \times 7) + (81.5 \times 4) + (84.5 \times 2)}{30} = 76.5.

A.

7-10

B.

4-7

C.

10-13

D.

13-16
Correct Answer: A

Solution:

The median class is the one where the cumulative frequency reaches or exceeds half of the total number of observations. Here, the cumulative frequency for 7-10 letters is 76, which is the median class.

A.

5.5

B.

6.5

C.

7.5

D.

8.5
Correct Answer: A

Solution:

The mean number of plants is calculated by multiplying the mid-point of each interval by the number of houses and dividing by the total number of houses. Mean = (1×1 + 3×2 + 5×1 + 7×5 + 9×6 + 11×2 + 13×3) / 20 = 5.5.

A.

65%

B.

70%

C.

75%

D.

80%
Correct Answer: B

Solution:

To find the mean literacy rate, calculate the sum of the midpoints of each class interval multiplied by their respective frequencies, divided by the total number of cities. The mean literacy rate is approximately 70%.

A.

₹2500-3000

B.

₹1500-2000

C.

₹3000-3500

D.

₹2000-2500
Correct Answer: B

Solution:

The mode is the class interval with the highest frequency. Here, the class ₹1500-2000 has the highest frequency of 40 families, making it the modal class.

A.

52.5 kg

B.

57.5 kg

C.

62.5 kg

D.

67.5 kg
Correct Answer: B

Solution:

To find the median, we determine the cumulative frequency and locate the median class. The median class is 55-60 kg, as it contains the 15th student (n/2 = 15). Using the median formula, the median weight is 57.5 kg.

A.

1-4 letters

B.

4-7 letters

C.

7-10 letters

D.

10-13 letters
Correct Answer: C

Solution:

The mode is the value that appears most frequently. Here, the highest frequency is 40, corresponding to the 7-10 letters interval. Therefore, the modal size is 7-10 letters.

A.

3250 hours

B.

3500 hours

C.

3000 hours

D.

3750 hours
Correct Answer: C

Solution:

The median class is the one where the cumulative frequency reaches 200 (half of 400). This occurs in the 2500-3000 hours class, making the median life time 3000 hours.

A.

20-30 cars

B.

30-40 cars

C.

40-50 cars

D.

50-60 cars
Correct Answer: C

Solution:

The mode is the class interval with the highest frequency. Here, the class 40-50 cars has the highest frequency of 20.

A.

10.5 days

B.

12 days

C.

15 days

D.

9 days
Correct Answer: A

Solution:

To find the mean, multiply the mid-point of each class interval by the frequency, sum these products, and divide by the total number of students. Mean = (3×11)+(8×10)+(12×7)+(17×4)+(24×4)+(33×3)+(39×1)40=10.5\frac{(3 \times 11) + (8 \times 10) + (12 \times 7) + (17 \times 4) + (24 \times 4) + (33 \times 3) + (39 \times 1)}{40} = 10.5 days.

A.

₹20,833

B.

₹21,000

C.

₹22,500

D.

₹23,000
Correct Answer: A

Solution:

The mean is calculated using: xˉ=ΣfixiΣfi\bar{x} = \frac{\Sigma f_i x_i}{\Sigma f_i} where xix_i is the midpoint of each class interval. The midpoints are ₹12,500, ₹17,500, ₹22,500, ₹27,500, and ₹32,500. Calculating the sum of fixif_i x_i gives ₹1,250,000, and the sum of frequencies is 60. Thus, xˉ=1,250,00060=₹20,833\bar{x} = \frac{1,250,000}{60} = ₹20,833.

A.

55 kg

B.

57.5 kg

C.

60 kg

D.

62.5 kg
Correct Answer: B

Solution:

The median class is the one where the cumulative frequency reaches 15.5 (half of 30). This occurs in the 55-60 kg class, making the median weight 57.5 kg.

A.

68%

B.

70%

C.

72%

D.

74%
Correct Answer: B

Solution:

To find the mean, calculate the mid-point of each class and multiply by the frequency, then divide by the total number of cities. Mean = (503 + 6010 + 7011 + 808 + 90*3) / 35 = 70%.

A.

2750 hours

B.

3250 hours

C.

3750 hours

D.

4250 hours
Correct Answer: B

Solution:

To find the median, calculate the cumulative frequency and locate the median class. The cumulative frequency just over 200 (half of 400) is 216 in the class 3000-3500, making it the median class. Using the formula, the median is calculated as 3250 hours.

A.

14

B.

15

C.

16

D.

17
Correct Answer: B

Solution:

To find the median, arrange the data in ascending order: 10, 12, 14, 15, 16, 18, 20. The median is the middle value, which is 15.

A.

4

B.

5

C.

6

D.

7
Correct Answer: A

Solution:

The mean is calculated as the total number of plants divided by the number of houses. Total plants = (0 \times 1) + (2 \times 2) + (4 \times 1) + (6 \times 5) + (8 \times 6) + (10 \times 2) + (12 \times 3) = 80. Number of houses = 20. Mean = 80/20 = 4.

A.

55.5 mangoes

B.

56.5 mangoes

C.

57.5 mangoes

D.

58.5 mangoes
Correct Answer: C

Solution:

Mean number of mangoes = (5115 + 54110 + 57135 + 60115 + 63*25) / 400 = 57.5 mangoes.

A.

550

B.

540

C.

530

D.

560
Correct Answer: B

Solution:

Calculate the mid-point of each wage interval and multiply by the frequency, then divide by the total number of workers: Mean=(510×12)+(530×14)+(550×8)+(570×6)+(590×10)50=540\text{Mean} = \frac{(510 \times 12) + (530 \times 14) + (550 \times 8) + (570 \times 6) + (590 \times 10)}{50} = 540.

A.

32.5 years

B.

35 years

C.

37.5 years

D.

40 years
Correct Answer: C

Solution:

The median is the age at which half the policy holders are younger and half are older. With 100 policy holders, the median is the 50th policy holder. The cumulative frequency just above 50 is 78, which corresponds to the age group 35-40. Therefore, the median age is approximately 37.5 years.

A.

145.5 mm

B.

146.5 mm

C.

147.5 mm

D.

148.5 mm
Correct Answer: B

Solution:

The median class is 145-153 mm. Using the median formula, the median length is 146.5146.5 mm.

A.

10 books

B.

12.5 books

C.

15 books

D.

7.5 books
Correct Answer: B

Solution:

To find the median, we determine the cumulative frequency and locate the median class. The cumulative frequencies are 5, 20, 40, 50. The median class is 10-15 books. Using the median formula: Median=l+(n2−cff)×h\text{Median} = l + \left(\frac{\frac{n}{2} - cf}{f}\right) \times h where l=10l = 10, cf=20cf = 20, f=20f = 20, h=5h = 5, and n=50n = 50. Substituting these values, we get Median=10+(25−2020)×5=12.5\text{Median} = 10 + \left(\frac{25 - 20}{20}\right) \times 5 = 12.5 books.

A.

5

B.

6

C.

7

D.

8
Correct Answer: B

Solution:

Using the formula for mean: xˉ=ΣfixiΣfi\bar{x} = \frac{\Sigma f_i x_i}{\Sigma f_i}. Given mean = 18, solve for ff using the equation: 18=(12×7)+(14×6)+(16×9)+(18×13)+(20×f)+(22×5)+(24×4)44+f18 = \frac{(12 \times 7) + (14 \times 6) + (16 \times 9) + (18 \times 13) + (20 \times f) + (22 \times 5) + (24 \times 4)}{44 + f}. Solving gives f=6f = 6.

A.

The distribution is symmetric.

B.

The distribution is positively skewed.

C.

The distribution is negatively skewed.

D.

The skewness cannot be determined.
Correct Answer: B

Solution:

In a positively skewed distribution, the mode is less than the mean. Here, since the mode (150 grams) is greater than the mean (145 grams), the distribution is positively skewed.

A.

₹1750

B.

₹2000

C.

₹2250

D.

₹2500
Correct Answer: B

Solution:

The mean is calculated as: (10 * 1250 + 15 * 1750 + 20 * 2250 + 5 * 2750) / (10 + 15 + 20 + 5) = ₹2000.

A.

0.1126

B.

0.1251

C.

0.0996

D.

0.1353
Correct Answer: B

Solution:

The probability of observing kk events in a Poisson distribution is given by P(k;λ)=e−λλkk!P(k; \lambda) = \frac{e^{-\lambda} \lambda^k}{k!}. For λ=10\lambda = 10 and k=8k = 8, P(8;10)=e−101088!≈0.1251P(8; 10) = \frac{e^{-10} 10^8}{8!} \approx 0.1251.

A.

65%

B.

70%

C.

75%

D.

80%
Correct Answer: B

Solution:

Calculate the mean by multiplying the mid-point of each class interval by the number of cities, summing these products, and dividing by the total number of cities: (50×3)+(60×10)+(70×11)+(80×8)+(90×3)35=70\frac{(50 \times 3) + (60 \times 10) + (70 \times 11) + (80 \times 8) + (90 \times 3)}{35} = 70.

A.

1.5

B.

2.0

C.

2.5

D.

3.0
Correct Answer: B

Solution:

The z-score is calculated using the formula: z=x−μσz = \frac{x - \mu}{\sigma}, where xx is the student's height, μ\mu is the mean, and σ\sigma is the standard deviation. Substituting the given values, z=180−16010=2.0z = \frac{180 - 160}{10} = 2.0.

True or False

Correct Answer: False

Solution:

For calculating the median of grouped data, class intervals must be continuous to apply the median formula correctly.

Correct Answer: True

Solution:

The median is less affected by extreme values and provides a better measure of central tendency in such cases.

Correct Answer: False

Solution:

The mean is not always the best measure of central tendency when there are extreme values, as it can be heavily influenced by them. In such cases, the median is often a better measure.

Correct Answer: True

Solution:

Cumulative frequency distribution helps in determining the count of observations below a certain threshold.

Correct Answer: True

Solution:

The mode identifies the most frequently occurring value, making it ideal for such analyses.

Correct Answer: True

Solution:

The median is often more appropriate than the mean in situations where extreme values may skew the data, as it is not affected by outliers.

Correct Answer: True

Solution:

Ogives are graphical representations of cumulative frequency distributions.

Correct Answer: True

Solution:

The mean considers all data points, making it a comprehensive measure of central tendency.

Correct Answer: False

Solution:

Cumulative frequency distributions can be constructed for both equal and unequal class intervals.

Correct Answer: True

Solution:

For calculating the median of grouped data, it is necessary that the class intervals are continuous before applying the formula.

Correct Answer: True

Solution:

The formula for finding the median of grouped data is correctly given as Median=l+(n2−cff)×h\text{Median} = l + \left( \frac{\frac{n}{2} - cf}{f} \right) \times h, where each symbol has a specific meaning.

Correct Answer: True

Solution:

The cumulative frequency is calculated by summing the frequencies of all classes up to and including the given class.

Correct Answer: True

Solution:

The median is less affected by extreme values, making it a better measure of central tendency in such cases.

Correct Answer: True

Solution:

For calculating the median of grouped data, it is essential that the class intervals are continuous to apply the formula correctly.

Correct Answer: True

Solution:

The mode for grouped data can be found by using a formula, which takes into account the class interval and frequencies.

Correct Answer: True

Solution:

The mean for grouped data can indeed be calculated using the direct method, assumed mean method, and step deviation method, as each method provides a way to handle the data effectively.

Correct Answer: True

Solution:

The formula for calculating the median assumes that the class intervals are continuous.

Correct Answer: True

Solution:

The mode is defined as the value that appears most frequently in a dataset.

Correct Answer: True

Solution:

Extreme values can skew the mean, making it a poor representative of the data in such cases.

Correct Answer: True

Solution:

To apply the median formula for grouped data, class intervals must be continuous.

Correct Answer: False

Solution:

Cumulative frequency distributions can be of both 'less than' and 'more than' types.

Correct Answer: True

Solution:

Cumulative frequency distributions can be constructed as 'less than' or 'more than' types, depending on whether they accumulate frequencies from the lower or upper ends of the data.

Correct Answer: False

Solution:

For calculating the median, class intervals must be continuous. If they are not, adjustments need to be made.

Correct Answer: True

Solution:

The mean takes into account all the observations and lies between the extremes, making it a widely used measure of central tendency.

Correct Answer: True

Solution:

This is the correct formula for calculating the mode of grouped data, where ll is the lower limit of the modal class, fmf_m is the frequency of the modal class, f1f_1 and f2f_2 are the frequencies of the classes before and after the modal class, and hh is the class width.

Correct Answer: True

Solution:

The median is a better measure of central tendency in the presence of extreme values as it is not influenced by them.

Correct Answer: True

Solution:

Continuous class intervals are necessary for applying the median formula to grouped data.

Correct Answer: True

Solution:

The median is the value that divides the data into two equal halves.

Correct Answer: True

Solution:

The cumulative frequency of a class in a cumulative frequency distribution is indeed the sum of frequencies of all classes up to and including that class.

Correct Answer: False

Solution:

The mean is not always the best measure of central tendency, especially in the presence of extreme values. In such cases, the median might be more appropriate.

Correct Answer: True

Solution:

To find the median class, we locate the class whose cumulative frequency is greater than and nearest to half the total number of observations.

Correct Answer: False

Solution:

The mean is not always the best measure of central tendency when there are extreme values, as it can be affected by those values. In such cases, the median is often a better measure.

Correct Answer: True

Solution:

The mode can be calculated for grouped data with unequal class sizes, although it is not discussed in detail.

Correct Answer: False

Solution:

For calculating the mode of grouped data, it is necessary that the class intervals are continuous to apply the formula correctly.

Correct Answer: True

Solution:

The modal class is defined as the class interval with the highest frequency in a frequency distribution.

Correct Answer: False

Solution:

For calculating the median for grouped data, it is necessary to ensure that the class intervals are continuous before applying the formula.

Correct Answer: True

Solution:

The cumulative frequency of a class is obtained by adding the frequencies of all the classes preceding the given class.

Correct Answer: True

Solution:

The mode is used to find the most frequent or popular item in a dataset, making it the best measure for such purposes.

Correct Answer: False

Solution:

The mean is the most frequently used measure of central tendency because it considers all observations. The mode is used to find the most frequent value.

Correct Answer: True

Solution:

The cumulative frequency of a class is indeed the sum of the frequencies of all classes preceding it.

Correct Answer: True

Solution:

The median class is determined by finding the class whose cumulative frequency is greater than and nearest to half the total number of observations.

Correct Answer: True

Solution:

The mean for grouped data can indeed be calculated using three methods: the direct method, the assumed mean method, and the step deviation method.

Correct Answer: True

Solution:

The median is more appropriate than the mean when data has extreme values because it is not affected by them and provides a 'typical' observation.

Correct Answer: False

Solution:

The mean is not always a better measure of central tendency than the mode. The mode is preferred in situations requiring the most frequent value or most popular item, such as finding the most popular TV program.