What is the common threshold for statistical significance in hypothesis testing?
What does Sullivan's index measure in public health?
What is the type of sampling used when a random sample is taken from distinct groups within a population, such as religious groups like Hindus, Muslims, and Christians?
In a clinical study examining the relationship between weight and height in pediatric patients, what is the maximum possible value of the correlation coefficient if the correlation is very strong?
Which graphical representation is best suited for depicting continuous quantitative data?
Which of the following fields is primarily associated with the Hardy-Weinberg law?
What does the net reproduction rate indicate?
What is the statistical term for the value that occurs most frequently in a data set?
In epidemiological studies, which type of diagram is most effective for representing disease incidence trends over time?
In the context of public health, which statistical measure is most commonly used to assess the variability of health-related data?
Explanation: ***Correct: 0.05*** - A **p-value of 0.05 (or 5%)** is the most widely accepted and **conventional threshold** for statistical significance in most scientific fields, including medicine - This represents a **5% probability** of observing the results if the **null hypothesis** were true (Type I error or α level) - This is the **standard alpha level** taught in biostatistics and most commonly used in medical research *Incorrect: 0.01* - While 0.01 indicates **higher statistical confidence** (1% chance of Type I error), it is more stringent than the standard threshold - Used in studies requiring **greater certainty** or where false positives have severe consequences - Not the most common or default threshold in general hypothesis testing *Incorrect: 0.02* - A p-value of 0.02 represents a **2% chance of Type I error** - While statistically valid, it is **not a conventional alpha level** for most hypothesis tests - Not the standard threshold taught or applied in medical statistics *Incorrect: 0.03* - A p-value of 0.03 represents a **3% chance of Type I error** - This is **not a standard choice** for statistical significance testing - Not the conventionally prescribed alpha level in biostatistics
Explanation: ***Years of life free of disability*** - **Sullivan's index**, also known as **disability-free life expectancy (DFLE)**, directly measures the average number of years a person is expected to live in good health, free from any major disabling conditions. - It is calculated by subtracting the expected years of life lived with disability from the total life expectancy. *Total life expectancy* - This measures the average number of years an individual is expected to live, regardless of their health status, and does not specifically account for the presence of disability. - While it is a component of Sullivan's index, it is not what Sullivan's index itself measures. *Quality of life index* - This is a broader concept that incorporates various aspects of an individual's well-being, including physical health, mental health, social relationships, and environment, and is not solely focused on disability-free years. - It often involves subjective assessments of satisfaction and well-being, which differs from the objective measure of disability-free life. *Life expectancy with disability* - This measures the average number of years an individual is expected to live while experiencing some form of disability, which is the opposite of what Sullivan's index aims to quantify. - Sullivan's index subtracts these years to highlight the healthy years of life.
Explanation: ***Stratified random*** - This method involves dividing the population into **distinct, non-overlapping subgroups (strata)** based on a shared characteristic (e.g., religious groups). - A **random sample** is then drawn from each stratum, ensuring representation from all groups. *Simple random* - Involves selecting individuals entirely at **random** from the entire population, with each individual having an equal chance of being chosen. - It does not guarantee representation from specific subgroups within the population. *Systematic random* - This method selects individuals at **regular intervals** from a randomly ordered list of the population (e.g., every 10th person). - While it offers a degree of randomness, it does not specifically account for or ensure representation of distinct subgroups. *Cluster* - This method involves dividing the population into **clusters (natural groupings)**, usually geographically, and then randomly selecting entire clusters to sample. - Unlike stratified sampling, where individuals are selected from each stratum, cluster sampling involves sampling all individuals within chosen clusters.
Explanation: ***+1 (perfect positive correlation)*** - A correlation coefficient of **+1** indicates a perfect positive linear relationship between two variables, meaning as one variable increases, the other increases proportionally. - This value represents the **maximum possible strength** for a positive correlation. *0* - A correlation coefficient of **0** indicates no linear relationship between two variables. - This would contradict the premise that the correlation is "very strong". *+2 (invalid value for correlation coefficient)* - The correlation coefficient, also known as Pearson's r, can only range from **-1 to +1**. - A value of +2 is outside this possible range and is therefore an **invalid value**. *No correlation (not possible for strong correlation)* - **No correlation** implies a correlation coefficient of 0 or close to 0. - This directly contradicts the statement that there is a **very strong correlation** between weight and height.
Explanation: **Histogram** - A **histogram** is specifically designed for depicting the distribution of **continuous quantitative data** by dividing the data into bins and showing the frequency of data points within each bin. - The bars in a histogram are adjacent, indicating the continuous nature of the data and representing ranges of values. *Bar diagram* - A **bar diagram** (or bar chart) is typically used for comparing **discrete categories** or displaying changes over time for categorical data. - The bars in a bar diagram are usually separated, emphasizing distinct categories rather than continuous ranges. *Pie chart* - A **pie chart** is used to show the **proportions of a whole**, representing parts of a composition for categorical data. - It is not suitable for continuous data as it provides no information about the distribution or frequency across a range of values. *Pictogram* - A **pictogram** uses images or icons to represent data, making it visually engaging, but it is generally used for **simple comparisons of discrete or categorical data**. - It lacks the precision and detail required to accurately depict the distribution or frequency of continuous quantitative data.
Explanation: ***Population genetics*** - The **Hardy-Weinberg law** is a fundamental principle in **population genetics** that describes allele and genotype frequencies in a population. - It establishes a baseline for hypothetical populations that are not evolving, allowing for the study of deviations caused by evolutionary forces. - The equation (p² + 2pq + q² = 1) predicts genotype frequencies from allele frequencies under specific conditions. *Health economics* - **Health economics** applies economic theories to the healthcare sector, focusing on efficiency, effectiveness, and value. - This field is concerned with resource allocation, financing, and policy in health, not genetic frequencies. *Social medicine* - **Social medicine** investigates the social and environmental determinants of health and disease. - It focuses on public health, health disparities, and the societal factors influencing well-being, which is distinct from genetic population dynamics. *Epidemiology* - **Epidemiology** studies the distribution and determinants of disease in populations. - While both fields study populations, epidemiology focuses on disease patterns and risk factors, not genetic equilibrium or allele frequencies.
Explanation: ***Average number of daughters a newborn girl will have during her lifetime*** - The **net reproduction rate (NRR)** specifically measures the average number of **daughters** a newborn girl is expected to have throughout her reproductive years, taking into account **mortality** rates. - An NRR of 1 indicates that each generation of women is exactly replacing itself, while an NRR greater or less than 1 suggests population growth or decline, respectively. - This is the **correct definition** of NRR and focuses on female offspring as they are the ones who will contribute to the next generation. *Number of live births per 1000 mid-year population* - This describes the **crude birth rate (CBR)**, which is a general measure of fertility but does not account for the age and sex structure of the population or mortality rates. - It includes all live births in relation to the total population, not specifically focusing on the generational replacement of females. *Number of live births per 1000 women of child bearing age* - This definition refers to the **general fertility rate (GFR)**, which is a more refined measure of fertility than the crude birth rate, as it focuses on women in their reproductive years (typically 15-49 years). - However, it still does not track the replacement of daughters who will become mothers, nor does it factor in mortality within the female population. *None of the options* - This option is incorrect because one of the given options accurately defines the net reproduction rate. - The net reproduction rate is a well-established demographic indicator used in population studies and public health planning.
Explanation: ***Mode*** - The **mode** is the value that appears most often in a set of data. - It represents the **most frequent observation** within a dataset. *Median* - The **median** is the middle value in a dataset when the values are arranged in ascending or descending order. - It is a measure of **central tendency** that is less affected by outliers than the mean. *Standard deviation* - **Standard deviation** measures the amount of variation or dispersion of a set of values. - A low standard deviation indicates that the data points tend to be **close to the mean**. *Mean* - The **mean** is the arithmetic average of a dataset, obtained by summing all values and dividing by the number of values. - It is a common measure of **central tendency** but can be influenced by extreme values.
Explanation: ***Line graph*** - A **line graph** is ideal for visualizing **trends over time** because it connects data points sequentially, making it easy to observe increases, decreases, or stability in disease incidence. - The x-axis typically represents **time intervals** (e.g., years, months), and the y-axis represents the incidence rate, clearly showing how these values change. *Bar graph* - A **bar graph** is generally used for comparing **discrete categories** or displaying quantities for different groups, not for continuous trends over time. - While it can show incidence for different time periods, it doesn't convey the **continuity** or the overall progression as effectively as a line graph. *Scatter plot* - A **scatter plot** is primarily used to display the **relationship between two numerical variables** or to identify correlations. - It does not inherently show a **trend over time** as clearly as a line graph; instead, it shows individual data points and their distribution. *Pie chart* - A **pie chart** is used to show **proportions or percentages** of a whole, making it suitable for displaying the distribution of categories at a single point in time. - It is **not appropriate** for showing changes or trends over time, as it cannot effectively represent sequential data or temporal patterns.
Explanation: ***Standard deviation*** - The **standard deviation** is the most common measure of **variability** in public health, as it quantifies the average amount of dispersion or spread around the mean. - It is particularly useful because it is expressed in the same units as the original data, making it easy to interpret and compare differences in health outcomes. *Mean* - The **mean** is a measure of **central tendency**, representing the average value of a dataset. - While essential for understanding the typical value, it does not provide information about the **spread or variability** of the data. *Range* - The **range** is the difference between the **maximum and minimum values** in a dataset, offering a rudimentary measure of variability. - It is highly susceptible to **outliers** and does not give a comprehensive picture of data distribution, as it only considers two values. *Variance* - **Variance** measures the average of the **squared differences** from the mean, providing an indication of how far data points deviate from the average. - While closely related to standard deviation, its units are squared, making it less intuitive for direct interpretation of variability compared to the **standard deviation**.
Collection and Presentation of Data
Practice Questions
Measures of Central Tendency
Practice Questions
Measures of Dispersion
Practice Questions
Normal Distribution
Practice Questions
Sampling Methods
Practice Questions
Sample Size Calculation
Practice Questions
Hypothesis Testing
Practice Questions
Tests of Significance
Practice Questions
Correlation and Regression
Practice Questions
Survival Analysis
Practice Questions
Multivariate Analysis
Practice Questions
Statistical Software in Research
Practice Questions
Get full access to all questions, explanations, and performance tracking.
Start For Free