Learning Objectives
By the end of this chapter, you will be able to:
- Distinguish between descriptive and inferential statistics
- Define population, sample, parameter, and statistic
- Identify when to use descriptive vs inferential methods
- Understand the relationship between samples and populations
The Two Branches of Statistics
Statistics as a science can be divided into two main branches:
flowchart TB
A[STATISTICS] --> B[Descriptive Statistics]
A --> C[Inferential Statistics]
B --> B1["Summarizes data"]
B --> B2["Describes characteristics"]
B --> B3["Uses tables, charts, measures"]
B --> B4["No generalizations beyond data"]
C --> C1["Makes inferences"]
C --> C2["Generalizes to population"]
C --> C3["Uses probability theory"]
C --> C4["Tests hypotheses"]
style B fill:#e1f5fe
style C fill:#fff3e0
Descriptive Statistics
Descriptive statistics consists of methods for organizing, displaying, and summarizing data using tables, graphs, and numerical measures.
Purpose of Descriptive Statistics
- Organize raw data into meaningful form
- Summarize large datasets using key measures
- Present data in tables and charts
- Describe the basic features of the data
Key Components
mindmap
root((Descriptive Statistics))
Organizing Data
Frequency Tables
Cross-tabulations
Graphical Display
Bar Charts
Histograms
Pie Charts
Line Graphs
Numerical Measures
Central Tendency
Mean
Median
Mode
Dispersion
Range
Variance
Standard Deviation
Example: Descriptive Statistics in Public Administration
Scenario: A district administration collects data on 500 households regarding their access to public services.
| Service | Households with Access | Percentage |
|---|---|---|
| Drinking Water | 425 | 85% |
| Electricity | 475 | 95% |
| Health Post | 350 | 70% |
| Primary School | 460 | 92% |
| Road Access | 380 | 76% |
Descriptive Summary:
- Average access across services: 83.6%
- Most accessible service: Electricity (95%)
- Least accessible service: Health Post (70%)
Key Point: Descriptive statistics only describes THIS dataset. It does not make claims about other districts or the entire country.
Inferential Statistics
Inferential statistics consists of methods that use sample data to make generalizations, predictions, or decisions about a larger population.
Purpose of Inferential Statistics
- Estimate population characteristics from samples
- Test hypotheses about populations
- Make predictions based on data
- Quantify uncertainty in conclusions
The Logic of Inference
flowchart LR
A[Population<br/>N = Large/Unknown] -->|Select Sample| B[Sample<br/>n = Small/Known]
B -->|Compute| C[Sample Statistics<br/>x̄, s, p]
C -->|Estimate| D[Population Parameters<br/>μ, σ, P]
D -->|Decision Making| E[Conclusions about<br/>Population]
style A fill:#ffcdd2
style B fill:#c8e6c9
style E fill:#b3e5fc
Example: Inferential Statistics in Public Administration
Scenario: A government wants to know the average satisfaction level of all public service users in Nepal (population = 30 million citizens).
Process:
- Population: All citizens using public services (too large to survey all)
- Sample: Randomly select 2,000 citizens from different regions
- Measure: Average satisfaction score in sample = 6.8 out of 10
- Inference: “We estimate that the average satisfaction level of ALL citizens is between 6.5 and 7.1 (with 95% confidence)”
Key Point: Inferential statistics allows us to make statements about populations based on samples, with a known degree of uncertainty.
Key Terminology
Understanding these terms is essential for the entire course:
Population vs Sample
| Concept | Definition | Symbol | Example |
|---|---|---|---|
| Population | Complete set of all items of interest | $N$ | All government employees in Nepal |
| Sample | Subset of the population selected for study | $n$ | 500 randomly selected employees |
Parameter vs Statistic
| Concept | Definition | Notation | Example |
|---|---|---|---|
| Parameter | Numerical measure describing a population | Greek letters ($\mu$, $\sigma$, $P$) | True average salary of ALL employees |
| Statistic | Numerical measure computed from a sample | Roman letters ($\bar{x}$, $s$, $p$) | Average salary of 500 sampled employees |
flowchart TB
subgraph "Population"
A[Parameter μ = Unknown<br/>True population mean]
end
subgraph "Sample"
B[Statistic x̄ = Known<br/>Calculated sample mean]
end
A -.->|"Sampling"| B
B -->|"Inference"| A
style A fill:#ffecb3
style B fill:#c8e6c9
Common Notation
| Measure | Population Parameter | Sample Statistic |
|---|---|---|
| Mean | $\mu$ (mu) | $\bar{x}$ (x-bar) |
| Standard Deviation | $\sigma$ (sigma) | $s$ |
| Proportion | $P$ | $p$ |
| Size | $N$ | $n$ |
| Variance | $\sigma^2$ | $s^2$ |
Comparison: Descriptive vs Inferential Statistics
| Aspect | Descriptive Statistics | Inferential Statistics |
|---|---|---|
| Purpose | Describe and summarize data | Make generalizations about population |
| Scope | Limited to collected data | Extends beyond collected data |
| Methods | Tables, charts, measures | Estimation, hypothesis testing |
| Uncertainty | No uncertainty (describes what IS) | Involves probability of error |
| Population/Sample | Can be used for either | Specifically uses sample to infer about population |
| Output | Exact values | Estimates with confidence levels |
Visual Comparison
flowchart TB
subgraph Descriptive["Descriptive Statistics"]
D1[Collected Data] --> D2[Summary Measures]
D2 --> D3["Result: 'The average<br/>in this sample is 75'"]
end
subgraph Inferential["Inferential Statistics"]
I1[Sample Data] --> I2[Statistical Analysis]
I2 --> I3["Result: 'The population<br/>average is likely<br/>between 72 and 78'"]
end
style D3 fill:#e8f5e9
style I3 fill:#fff3e0
When to Use Each Type
Use Descriptive Statistics When:
✅ You have data from an entire population (census) ✅ You only need to summarize the data at hand ✅ You’re creating reports or dashboards ✅ You’re doing exploratory data analysis
Example: Annual report showing department-wise budget expenditure for the fiscal year.
Use Inferential Statistics When:
✅ You cannot collect data from the entire population ✅ You need to make predictions about a population ✅ You want to test a hypothesis ✅ You need to generalize findings
Example: Using a sample survey to estimate citizen satisfaction across all municipalities.
Practical Application in Public Administration
Case Study: Employee Performance Analysis
Situation: The Public Service Commission wants to understand employee performance patterns.
Descriptive Approach:
- Collect performance scores of ALL 450,000 government employees
- Calculate: Mean = 72.5, Median = 74, Mode = 75
- Create: Distribution chart showing performance categories
- Conclusion: “The average performance score of government employees is 72.5”
Inferential Approach:
- Randomly sample 5,000 employees
- Calculate: Sample mean = 71.8, Standard deviation = 12.3
- Estimate: Population mean is between 71.4 and 72.2 (95% confidence)
- Conclusion: “We are 95% confident that the true average performance score of ALL government employees is between 71.4 and 72.2”
Key Insight
| Descriptive | Inferential |
|---|---|
| Tells us what the data shows | Tells us what the data suggests about the larger picture |
| Certain but limited | Uncertain but generalizable |
The Statistical Process
Understanding how descriptive and inferential statistics fit into the overall research process:
flowchart TB
A[Research Question] --> B[Define Population]
B --> C{Can we study<br/>entire population?}
C -->|Yes| D[Census/Complete Data]
C -->|No| E[Select Sample]
D --> F[Descriptive Statistics]
E --> F
F --> G{Need to generalize?}
G -->|No| H[Report Descriptive<br/>Summary]
G -->|Yes| I[Inferential Statistics]
I --> J[Draw Conclusions<br/>about Population]
style F fill:#e1f5fe
style I fill:#fff3e0
Practice Questions
Short Answer Questions
-
Define descriptive and inferential statistics with examples from public administration.
- Explain the relationship between:
- a) Population and Sample
- b) Parameter and Statistic
-
A researcher surveys 1,000 households and reports: “The average household size is 4.2 members.” Is this descriptive or inferential statistics? Explain.
- If a sample mean is $\bar{x} = 85$ and we estimate the population mean to be between 82 and 88, which statistics (descriptive or inferential) are being used?
Multiple Choice Questions
Q1. Which of the following is a population parameter?
- a) Sample mean $\bar{x}$
- b) Population mean $\mu$ ✓
- c) Sample size $n$
- d) Sample standard deviation $s$
Q2. Descriptive statistics is used to:
- a) Make predictions about populations
- b) Test hypotheses
- c) Summarize and describe data ✓
- d) Estimate population parameters
Q3. Using sample data to make generalizations about a population is called:
- a) Descriptive statistics
- b) Inferential statistics ✓
- c) Probability theory
- d) Data collection
Q4. A census is best analyzed using:
- a) Only inferential statistics
- b) Only descriptive statistics ✓
- c) Neither type
- d) Must use both types
Summary Table
| Concept | Definition | Key Feature |
|---|---|---|
| Descriptive Statistics | Summarizes data | No generalization beyond data |
| Inferential Statistics | Makes inferences about population | Uses probability theory |
| Population | Complete set of interest | Often too large to study entirely |
| Sample | Subset of population | Used when population is inaccessible |
| Parameter | Population measure | Usually unknown |
| Statistic | Sample measure | Calculated from sample data |
Next Chapter
In the next chapter, we will study Measures of Central Tendency - methods to find the “center” or typical value in a dataset, including mean, weighted mean, median, and mode.

