Survey Research: How to Create a Questionnaire Survey

Getting your Trinity Audio player ready...

Contents

What Is Survey Research?

Survey research is a systematic method of collecting data from a defined group of people, called respondents or participants, to describe, compare, or explain their knowledge, attitudes, behaviors, or outcomes. It is one of the most widely used research methods across disciplines, from public health and clinical medicine to sociology, psychology, and education.

At its core, survey research involves:

  • Defining a research question or hypothesis
  • Designing an instrument (questionnaire) to measure relevant variables
  • Administering the instrument to a sample from a target population
  • Analyzing the collected data
  • Interpreting and reporting findings

Types of Surveys

By Mode of Administration

TypeDescriptionExample Use
Self-administered (Paper)Respondents fill in a printed questionnairePatient satisfaction forms in a clinic
Online/Web-basedDelivered via platforms like SurveyMonkey, REDCap, Google FormsLarge-scale national health surveys
TelephoneInterviewer asks questions over a callPolitical polling, health behavior surveillance
Face-to-face (Structured Interview)Interviewer reads questions in personDemographic surveys in low-literacy populations
Computer-Assisted Personal Interview (CAPI)Tablet/laptop used during an in-person interviewCensus fieldwork
Mixed-modeCombination of two or more modesNational Health Interview Survey (USA)

By Research Design

DesignDescriptionExample
Cross-sectionalData collected at a single point in timePrevalence of depression among university students
Longitudinal/PanelSame respondents surveyed repeatedly over timeBritish Cohort Study tracking health outcomes
RetrospectiveAsks about past events or behaviorsLifetime tobacco use among adults
ProspectiveFollows participants forward to capture future eventsMonthly surveys tracking COVID-19 symptom onset

Advantages and Disadvantages of Survey Research

Advantages

  • Cost-effective: especially online surveys; reach large samples at minimal cost
  • Scalability: can be administered to hundreds or thousands simultaneously
  • Standardization: every participant answers the same questions, reducing interviewer bias
  • Anonymity: encourages honest responses on sensitive topics (e.g., substance use, sexual behavior)
  • Versatility: applicable across disciplines and topics
  • Quantifiability: structured responses can be easily coded and analyzed statistically
  • Wide geographical reach: online and telephone surveys cross geographical barriers

Disadvantages

  • Response bias: social desirability, acquiescence, and recall bias can distort answers
  • Low response rates: particularly for online surveys (often below 30%), threatening representativeness
  • Inflexibility: closed-ended questions cannot capture nuance or unexpected responses
  • Causality limitations: surveys establish associations, not causal relationships
  • Sampling bias: non-probability or convenience samples limit generalizability
  • Self-report limitations: participants may misremember, misunderstand, or misreport
  • Question order effects: earlier questions can prime or influence later answers

Types of Questions in a Survey

Closed-Ended Questions

  • Dichotomous: Yes / No options
    • “Have you ever been diagnosed with hypertension?”
  • Multiple choice (single answer): Select one from several options
    • “What is your highest level of education?”
  • Multiple response: Check all that apply
    • “Which of the following chronic conditions have you been diagnosed with? (Select all that apply)”
  • Rating scales: Numerical response (e.g., 1–10)
    • “Rate your current pain level from 0 (no pain) to 10 (worst possible pain).”
  • Likert scale: Agreement continuum (typically 5 or 7 points)
    • “I feel supported by my supervisor.” (Strongly Disagree → Strongly Agree)
  • Semantic differential: Bipolar adjective pairs
    • “This public health campaign was: Ineffective [1-2-3-4-5] Effective”
  • Rank-order: Respondents rank items by preference or priority

Open-Ended Questions

  • Allow free-text responses; used to capture rich, unanticipated data
  • “Please describe any barriers you face in accessing mental health services.”
  • Best used sparingly; harder to analyze at scale

Measurement Scales

ScalePropertiesExample
NominalCategories with no orderReligion, blood type, gender
OrdinalCategories with order but unequal intervalsLikert scales, education level
IntervalEqual intervals, no true zeroTemperature (°C), IQ score
RatioEqual intervals with a true zeroAge, weight, number of hospitalizations

Reliability and Validity in Survey Research

These are the two cornerstones of measurement quality. Reliability refers to consistency; validity refers to accuracy (measuring what you intend to measure).

Types of Reliability

TypeDefinitionHow to AssessExample
Test-retest reliabilityConsistency of scores over time when conditions are unchangedAdminister same survey twice; compute Pearson’s r or ICCPHQ-9 depression scale re-administered 2 weeks apart
Inter-rater reliabilityAgreement between two or more raters coding responsesCohen’s Kappa, ICCTwo researchers independently coding open-ended trauma responses
Intra-rater reliabilityConsistency of a single rater across timeCohen’s Kappa comparing ratings at two time pointsOne clinician re-scoring the same audio-recorded interviews
Internal consistencyExtent to which items in a scale measure the same constructCronbach’s alpha (α ≥ 0.70 acceptable); McDonald’s omegaAll items in an anxiety subscale should correlate with each other
Parallel forms reliabilityEquivalence between two different versions of the same testPearson’s r between Form A and Form B scoresTwo alternate versions of a medical knowledge quiz
Split-half reliabilityConsistency between two halves of a single instrumentSpearman-Brown corrected correlationOdd-numbered vs. even-numbered items on a self-esteem scale

Types of Validity

TypeDefinitionHow to AssessExample
Face validityAppears to measure the construct on the surface (subjective)Expert or respondent reviewSurvey on loneliness “looks like” it covers loneliness
Content validityItems adequately cover the full domain of the constructExpert panel review; Content Validity Index (CVI)All WHO health dimensions represented in a quality-of-life scale
Criterion validityScores predict or correlate with an external criterionPearson’s r or AUC vs. gold standardAUDIT-C alcohol screen validated against structured clinical interview
Concurrent validityCorrelation with a criterion measured at the same timeConcurrent correlationNew anxiety scale correlated with GAD-7
Predictive validityAbility to predict a future outcomeLongitudinal correlation or regressionBaseline depression score predicts rehospitalization at 6 months
Construct validityInstrument truly measures the theoretical constructFactor analysis, SEM, hypothesis testing
Convergent validityCorrelates positively with theoretically similar measuresMulti-trait multi-method (MTMM)Social support scale correlates with quality-of-life scale
Discriminant validityDoes not correlate with theoretically unrelated measuresMTMM, AVE > MSVAnxiety scale should NOT strongly correlate with cognitive ability
Structural validityFactor structure matches theoretical modelConfirmatory Factor Analysis (CFA)Single-factor structure of PHQ-9 confirmed in a new population
Cross-cultural validityInstrument performs equivalently across cultural/language groupsMeasurement invariance testing (CFA)SF-36 translated and validated in Hindi for Indian populations
Ecological validityFindings generalize to real-world settingsExpert judgment; comparison with field dataLab-based stress survey reflects real occupational stress
Internal validityExtent to which observed relationships are causal (more applicable to experimental designs)Research design controlsRuling out confounders in a pre-post survey intervention study
External validityGeneralizability of findings to other populations, settings, or timesSampling strategy, replicationNational survey results applicable to state-level population

How to Design a Questionnaire: Step-by-Step

Step 1: Define Your Research Objectives

Clearly specify what you want to measure before writing a single question.

  • Social science example: “To assess the association between perceived social support and academic burnout among postgraduate students.”
  • Biomedical example: “To measure adherence to antiretroviral therapy (ART) and identify its predictors among HIV-positive adults in a tertiary hospital.”

Step 2: Identify Your Target Population and Sample

Step 3: Decide on the Mode of Administration

Consider your population’s literacy level, internet access, sensitivity of the topic, and available resources.

Step 4: Select or Develop Your Instrument

Option A: Use a validated existing scale

Always prefer validated instruments when available. Here are some popular ones in research:

ScaleDomainUsed in
PHQ-9DepressionPrimary care, epidemiology
GAD-7Generalized anxietyMental health, clinical settings
AUDITAlcohol use disorderPublic health, addiction medicine
SF-36 / SF-12Health-related quality of lifeBiomedical, chronic disease research
Perceived Stress Scale (PSS)Psychological stressSocial and health sciences
Rosenberg Self-Esteem ScaleSelf-esteemDevelopmental and social psychology
Morisky Medication Adherence Scale (MMAS-8)Medication adherenceClinical and pharmacoepidemiology research

Option B: Develop new items

If no validated tool exists:

  • Conduct a literature review to identify the construct’s dimensions
  • Generate an item pool (3–5 items per dimension initially)
  • Conduct expert review for content validity
  • Pilot test on a small subsample
  • Analyze psychometric properties

Step 5: Write Clear, Unbiased Questions

Rules for writing good survey questions:

  • ✅ Use simple, everyday language
  • ✅ Ask one thing per question (avoid double-barreled questions)
  • ✅ Be specific about time frames (“In the past 30 days…”)
  • ✅ Provide mutually exclusive and exhaustive response options
  • ✅ Match the scale to the construct
  • ❌ Avoid leading questions (“Don’t you agree that…”)
  • ❌ Avoid loaded/emotionally charged language
  • ❌ Avoid double negatives (“Do you not disagree…”)
  • ❌ Avoid jargon or technical terms unfamiliar to respondents

Step 6: Structure the Questionnaire

A well-organized questionnaire follows this general flow:

  1. Title and brief introduction: state the purpose, confidentiality assurance, and estimated time
  2. Consent section (if applicable)
  3. Screening questions (if needed)
  4. Sociodemographic/background items: age, sex, education, occupation
  5. Core study variables: primary outcome and predictor measures
  6. Sensitive items: place later once rapport is established
  7. Open-ended items: place at the end; they require more effort
  8. Closing/thank-you statement

Step 7: Pilot Test the Questionnaire

  • Administer to a small group (typically n = 10–30) similar to your target population
  • Assess: clarity, comprehension, time to complete, missing responses
  • Revise problematic items
  • Assess internal consistency (Cronbach’s alpha) for multi-item scales

Step 8: Administer the Survey

  • Obtain ethical approval/IRB clearance if required
  • Train interviewers if using face-to-face or telephone modes
  • Track response rates; follow up with non-respondents
  • Monitor data quality in real time (for online surveys)

Step 9: Manage and Analyze Data

Examples of Survey Questions

Social Science Example: Academic Burnout Study

Background variable:

“What is your current year of postgraduate study?” ○ First year ○ Second year ○ Third year ○ Fourth year or above

Perceived Social Support (adapted from MSPSS):

“Please indicate how much you agree with each statement.” (1 = Strongly Disagree, 7 = Strongly Agree)

Item1234567
There is a special person who is around when I am in need
My family really tries to help me
I can count on my friends when things go wrong

Burnout (adapted from MBI-GS):

“How often do you experience the following? (0 = Never, 6 = Every day)”

Item0123456
I feel emotionally drained by my studies
I feel used up at the end of a study day

Biomedical Science Example: ART Adherence Study

Clinical background:

“How long have you been on antiretroviral therapy (ART)?” ○ Less than 6 months ○ 6–12 months ○ 1–3 years ○ More than 3 years

Adherence measure (adapted from MMAS-8):

“Please answer the following about your ART medications.”

ItemYesNo
Do you sometimes forget to take your HIV medicine?
Over the past two weeks, were there any days when you did not take your HIV medicine?
Have you ever cut back or stopped taking your medicine without telling your doctor because you felt worse when you took it?

Barrier assessment (open-ended):

“What are the main reasons, if any, that you have difficulty taking your HIV medications as prescribed? Please describe:”

How to Ensure Rigor in Survey Research

Sampling Rigor

  • Use probability sampling wherever feasible to ensure representativeness
  • Calculate adequate sample size a priori using power analysis (e.g., G*Power, OpenEpi)
  • Report response rates and compare respondents vs. non-respondents on key variables

Instrument Rigor

  • Use validated, culturally adapted instruments
  • Report Cronbach’s alpha or McDonald’s omega for all multi-item scales
  • Conduct confirmatory factor analysis (CFA) to test structural validity
  • For translated tools, conduct forward-backward translation and test measurement invariance

Data Collection Rigor

  • Standardize administration procedures with a protocol/manual
  • Train and certify all data collectors
  • Use double data entry or built-in logic checks (for electronic surveys)
  • Monitor for satisficing (respondents giving careless answers): embed attention checks

Reporting Rigor

  • Follow established reporting guidelines:
    • CHERRIES checklist: for online surveys
    • STROBE: for observational studies including surveys
    • COSMIN: for studies validating patient-reported outcome measures
  • Report: sampling frame, recruitment method, response rate, missing data handling, and analytic approach

Minimizing Common Biases

BiasDescriptionMitigation Strategy
Social desirability biasResponding in a socially acceptable wayAnonymity, indirect/projective questions
Acquiescence biasTendency to agree regardless of contentInclude reverse-scored items
Recall biasInaccurate memory of past eventsShorter recall periods; event anchoring
Non-response biasSystematic difference between responders and non-respondersFollow-up of non-respondents; compare on known variables
Order/context effectsEarlier items influence later responsesRandomize item/block order
Interviewer biasInterviewer behavior shapes responsesStandardized scripts; blind interviewing
Sampling biasSample not representative of populationProbability sampling; weighting

Tips for Maximizing Response Rates

  • Keep the survey as short as possible (aim for ≤15 minutes)
  • Send a pre-notification before the survey arrives
  • Use personalized invitations rather than generic mass emails
  • Include a brief, compelling rationale for participation
  • Assure confidentiality and data protection clearly
  • Offer reminders at 1 week and 2 weeks post-invitation
  • Consider incentives (monetary, vouchers, entry into a prize draw) where ethical
  • Optimize for mobile devices for online surveys
  • Use a trusted sender (institutional email rather than a commercial survey platform address)

Key Takeaways

  • Survey research is a versatile, cost-effective method for collecting data from large samples on knowledge, attitudes, behaviors, or outcomes.
  • Surveys can be cross-sectional, longitudinal, retrospective, or prospective, and administered in multiple modes (online, face-to-face, telephone).
  • Questionnaire design requires clear objectives, appropriate sampling, well-written questions, and a logical structure before any data collection begins.
  • Use validated instruments from the literature whenever available; develop new tools only when necessary, following rigorous psychometric procedures.
  • Reliability (test-retest, internal consistency, inter-rater) and validity (content, criterion, construct) are both essential and must be reported.
  • Pilot testing is non-negotiable: it catches ambiguous questions, technical glitches, and respondent burden issues.
  • Major biases (social desirability, acquiescence, recall, non-response) can be anticipated and mitigated through thoughtful design and administration.
  • Response rates should be maximized through pre-notification, reminders, brevity, and appropriate incentives.
  • Reporting should follow established checklists (CHERRIES, STROBE, COSMIN) to ensure transparency and reproducibility.
  • Sample size should be determined a priori based on power analysis, not convenience.

Frequently Asked Questions (FAQs)

What is the difference between a questionnaire and a survey?

A questionnaire is the data collection instrument: the set of questions. A survey is the broader research process that includes the questionnaire, sampling, administration, and analysis. A survey uses a questionnaire, but a questionnaire is not itself a survey.

How many response options should a Likert scale have: 5 or 7?

Both are commonly used. A 5-point scale is simpler and works well when cognitive burden is a concern or populations have lower literacy. A 7-point scale offers finer discrimination and may be more sensitive to change over time. Research generally shows 5- and 7-point scales perform similarly in terms of reliability and validity. Avoid even-numbered scales (which force a choice) unless you have a specific reason to eliminate a neutral midpoint.

When should I use skip logic (conditional branching) in a survey?

Use skip logic when certain questions are only relevant to a subset of respondents. For example: “Have you ever smoked cigarettes? (Yes/No) → If Yes: At what age did you start smoking?” Skip logic reduces respondent burden, minimizes irrelevant questions, and improves data quality. It is easily implemented in platforms like REDCap, Qualtrics, and SurveyMonkey.

What is the minimum acceptable Cronbach’s alpha for a scale?

The conventionally accepted threshold is α ≥ 0.70 for research purposes, and α ≥ 0.90 for clinical decision-making tools. However, alpha is sensitive to the number of items: longer scales artificially inflate it. Consider also reporting McDonald’s omega (ω), which is a more robust and less biased estimate of internal consistency, particularly when scale items have unequal factor loadings.

What is the difference between anonymous and confidential surveys?

  • Anonymous: No identifying information is collected at all; it is impossible to link responses to individuals.
  • Confidential: Identifying information may be collected (e.g., for follow-up), but the researcher commits to keeping it separate from responses and not disclosing it. Anonymity is stronger protection and tends to elicit more honest responses on sensitive topics. Confidentiality is necessary in longitudinal studies where participants must be tracked over time.

How do I handle missing data in survey research?

First, distinguish between missing completely at random (MCAR), missing at random (MAR), and missing not at random (MNAR): the mechanism affects which approach is appropriate. Common strategies include:

  • Listwise deletion: simple but can introduce bias and reduce power
  • Mean/median imputation: easy but underestimates variance
  • Multiple imputation (MI): preferred for MAR; generates several plausible datasets and pools results
  • Full information maximum likelihood (FIML): preferred in SEM contexts Always report the extent and pattern of missing data transparently.

Can survey research be used to establish causality?

Traditional cross-sectional surveys cannot establish causality: they can only demonstrate association. To strengthen causal inference with survey data, researchers can use: longitudinal/panel designs (establishing temporal precedence), natural experiments with survey data, or propensity score matching to control for confounders. Ultimately, experimental designs (RCTs) remain the gold standard for causal claims, but well-designed longitudinal surveys contribute substantially to causal evidence in fields like epidemiology and sociology.

What ethical considerations apply to survey research involving sensitive topics?

When surveying sensitive topics (mental health, substance use, sexuality, trauma, chronic illness):

  • Obtain informed consent clearly explaining the nature of questions before respondents begin
  • Provide trigger warnings if content may be distressing
  • Include resource signposting (e.g., helpline numbers) at the end of the survey
  • Ensure data security: encrypt data, use secure platforms (REDCap is preferred in health research)
  • Obtain IRB/ethics committee approval prior to data collection
  • Allow respondents to skip any question without penalty
  • Debrief participants after completion, especially in psychological research

Glossary of Key Terms in Survey Research

TermDefinition
Acquiescence BiasThe tendency of respondents to agree with survey statements regardless of their actual views. Mitigated by including reverse-scored items in scales.
Alpha (Cronbach’s α)A coefficient (ranging 0–1) measuring the internal consistency of a multi-item scale. Values ≥ 0.70 are generally considered acceptable for research purposes.
AnonymityA data collection condition in which no identifying information is collected, making it impossible to link responses to specific individuals.
Attrition BiasIn longitudinal surveys, the systematic dropout of participants over time, which can skew results if those who leave differ from those who remain.
Bipolar ScaleA rating scale anchored at both ends by opposing adjectives or statements (e.g., Ineffective ↔ Effective). Also called a semantic differential scale.
Branching / Skip LogicConditional routing in a questionnaire that directs respondents to different questions based on their previous answers, reducing respondent burden and irrelevant questions.
CAPIComputer-Assisted Personal Interviewing — a data collection method in which an interviewer uses a tablet or laptop to administer a structured questionnaire face-to-face.
CHERRIES ChecklistA reporting guideline specifically designed for internet-based surveys, ensuring transparency in design, sampling, response rates, and analysis.
Closed-Ended QuestionA question with a predefined set of response options (e.g., Yes/No, multiple choice, Likert scale). Easier to analyze than open-ended questions.
Cluster SamplingA probability sampling method in which the population is divided into natural groups (clusters) and whole clusters are randomly selected. Useful for geographically dispersed populations.
ConfidentialityA researcher’s commitment not to disclose identifiable participant information, even when such information has been collected.
ConstructA theoretical concept or attribute being measured (e.g., depression, social support, medication adherence).
Construct ValidityThe degree to which an instrument truly measures the theoretical construct it purports to measure. Assessed through convergent and discriminant validity, and factor analysis.
Content ValidityThe extent to which a questionnaire’s items adequately cover all dimensions of the construct being measured. Assessed by expert panel review and the Content Validity Index (CVI).
Content Validity Index (CVI)A quantitative measure of content validity, calculated as the proportion of experts rating each item as relevant. Item-CVI ≥ 0.78 and Scale-CVI ≥ 0.90 are standard thresholds.
Convergent ValidityEvidence that an instrument correlates positively and substantially with other measures of theoretically similar constructs.
COSMINCOnsensus-based Standards for the selection of health Measurement INstruments — a methodological framework and checklist for evaluating the quality of patient-reported outcome measures (PROMs).
Cross-Sectional SurveyA survey design that collects data from a sample at a single point in time; useful for estimating prevalence but cannot establish temporal relationships.
Cross-Cultural ValidityEvidence that an instrument performs equivalently across different cultural, linguistic, or demographic groups. Tested through measurement invariance analysis.
Data SaturationIn mixed-methods or qualitative survey analysis, the point at which new responses no longer yield new themes or insights.
Dichotomous QuestionA closed-ended question with only two response options (e.g., Yes / No; True / False).
Discriminant ValidityEvidence that an instrument does not correlate strongly with measures of theoretically unrelated constructs, demonstrating specificity.
Double-Barreled QuestionA flawed question that asks about two separate issues simultaneously, making it impossible to know which issue the response addresses. Example: “Do you find the course content interesting and the teaching effective?”
Ecological ValidityThe extent to which survey findings reflect real-world conditions and can be generalized to naturalistic settings outside the study context.
External ValidityThe degree to which survey findings can be generalized beyond the study sample to other populations, settings, or time periods.
Face ValidityThe superficial, subjective appearance that an instrument measures what it is intended to measure. The weakest form of validity; assessed informally by experts or respondents.
Exploratory Factor Analysis (EFA)A statistical technique used when the factor structure is unknown, to identify how items cluster into underlying dimensions.
Confirmatory Factor Analysis (CFA)A statistical technique used to test whether a hypothesized factor structure fits an observed dataset.
FIMLFull Information Maximum Likelihood — a method for handling missing data that uses all available data without imputation, producing unbiased estimates under the MAR assumption.
GAD-7The Generalized Anxiety Disorder 7-item scale; a widely validated self-report instrument for screening and measuring anxiety severity.
Gold StandardIn criterion validity testing, an established, highly accurate measure against which a new instrument is compared.
ICC (Intraclass Correlation Coefficient)A reliability statistic used to assess test-retest and inter-rater reliability, particularly for continuous data. Values > 0.75 are generally considered good.
IncentiveA reward (monetary, gift voucher, prize draw entry) offered to encourage survey participation. Must be proportionate and not coercive.
Internal ConsistencyThe degree to which all items within a scale measure the same underlying construct. Typically assessed with Cronbach’s alpha or McDonald’s omega.
Internal ValidityThe extent to which observed associations within a study reflect true relationships rather than confounding variables or methodological artifacts.
Inter-Rater ReliabilityThe degree of agreement between two or more independent raters when coding or scoring the same data. Assessed with Cohen’s Kappa or ICC.
Interval ScaleA measurement scale with equal distances between values but no true zero point (e.g., temperature in Celsius, IQ scores).
Intra-Rater ReliabilityThe consistency of a single rater’s judgments when scoring the same material on two separate occasions.
IRBInstitutional Review Board — an ethics committee that reviews and approves research involving human participants to ensure it meets ethical standards. Equivalent to a Research Ethics Committee (REC) in the UK/India.
Kappa (Cohen’s Kappa)A statistic measuring inter-rater or intra-rater agreement, corrected for chance. Values: < 0.20 = slight; 0.21–0.40 = fair; 0.41–0.60 = moderate; 0.61–0.80 = substantial; > 0.80 = almost perfect.
Leading QuestionA question phrased in a way that implies a desired answer, introducing bias. Example: “Don’t you agree that exercise is important for health?”
Likert ScaleAn ordinal rating scale — typically 5 or 7 points — measuring degree of agreement or disagreement with a statement. Named after psychologist Rensis Likert.
Longitudinal SurveyA survey design that follows the same participants over time, allowing assessment of change, trends, and temporal relationships between variables.
MCARMissing Completely at Random — a missing data pattern in which the probability of missingness is unrelated to any observed or unobserved variable. The least problematic type.
MARMissing at Random — a missing data pattern in which the probability of missingness is related to observed (but not missing) data. Amenable to imputation methods.
MNARMissing Not at Random — a missing data pattern in which the probability of missingness is related to the missing values themselves. The most problematic type.
McDonald’s Omega (ω)A reliability coefficient that, unlike Cronbach’s alpha, does not assume equal factor loadings, making it a more robust measure of internal consistency.
Measurement InvarianceA property indicating that an instrument measures the same construct in the same way across different groups. Tested via multi-group CFA.
Mixed-Mode SurveyA survey that uses two or more administration modes (e.g., online and telephone) to maximize coverage and response rates.
MMAS-8Morisky Medication Adherence Scale (8-item) — a self-report instrument widely used in clinical research to assess adherence to prescribed medications.
MSPSSMultidimensional Scale of Perceived Social Support — a 12-item validated scale measuring perceived support from family, friends, and a significant other.
Multiple Imputation (MI)A statistical method for handling missing data in which multiple plausible datasets are generated, analyzed separately, and pooled using Rubin’s rules.
Nominal ScaleThe most basic measurement level; classifies data into discrete, unordered categories with no numerical meaning (e.g., blood type, religion, ethnicity).
Non-Probability SamplingA sampling approach in which participants are not selected through random procedures, limiting generalizability. Includes convenience, purposive, snowball, and quota sampling.
Non-Response BiasA systematic distortion in results that occurs when individuals who do not respond differ meaningfully from those who do.
Open-Ended QuestionA question that allows respondents to answer in their own words, generating qualitative data. Useful for capturing nuance but harder to analyze at scale.
OperationalizationThe process of defining how an abstract construct (e.g., “depression,” “social cohesion”) will be concretely measured in a survey.
Ordinal ScaleA measurement scale in which categories have a meaningful order but intervals between them are not necessarily equal (e.g., Likert scales, education level).
Panel SurveyA longitudinal design in which the same respondents are surveyed repeatedly over time, enabling analysis of change at the individual level.
Parallel Forms ReliabilityThe consistency of scores between two different but equivalent versions of the same instrument, administered to the same group.
PHQ-9Patient Health Questionnaire-9 — a 9-item validated self-report instrument for screening and measuring depression severity, widely used in primary care and epidemiology.
Pilot TestA small-scale preliminary administration of a questionnaire to identify problems with clarity, length, response options, or technical functioning before full deployment.
PopulationThe entire group of individuals to whom a researcher wishes to generalize findings. Surveys collect data from a sample drawn from this population.
Power AnalysisA statistical procedure used to determine the minimum sample size needed to detect an effect of a given size with a specified level of confidence (typically 80% power at α = 0.05).
Predictive ValidityA subtype of criterion validity; the extent to which scores on an instrument predict a future outcome or criterion.
PROMPatient-Reported Outcome Measure — any measure of a patient’s health status reported directly by the patient, without interpretation by a clinician.
Purposive SamplingA non-probability sampling strategy in which participants are selected deliberately based on specific characteristics relevant to the research question.
QuestionnaireThe instrument — the actual set of questions — used to collect data in survey research. Distinct from the broader survey process itself.
Random SamplingA probability-based sampling method in which every member of the population has a known, non-zero chance of being selected, minimizing selection bias.
Ratio ScaleThe highest measurement level; has equal intervals and a true, meaningful zero point (e.g., age, weight, number of hospital admissions).
Recall BiasSystematic error arising from respondents’ inaccurate or incomplete memory of past events or behaviors, particularly problematic in retrospective surveys.
REDCapResearch Electronic Data Capture — a secure, web-based platform widely used in biomedical and social research for survey administration and data management.
ReliabilityThe consistency or stability of a measurement instrument across time, raters, or items. A necessary but not sufficient condition for validity.
Response RateThe proportion of sampled individuals who complete and return a survey. Calculated as: (Completed responses ÷ Eligible individuals contacted) × 100.
Reverse-Scored ItemA questionnaire item worded in the opposite direction of most items in a scale, used to detect acquiescence bias and ensure respondents read carefully.
SampleA subset of individuals drawn from a target population and actually surveyed. Findings are generalized from the sample to the population.
Sampling FrameThe complete list or database from which a sample is drawn (e.g., a hospital patient registry, university enrollment list, electoral roll).
SatisficingA response behavior in which respondents invest minimal cognitive effort, selecting the first plausible answer or consistently choosing the same response option.
ScaleA composite measure formed by combining scores from multiple related items to create a single score representing an underlying construct.
Semantic Differential ScaleA rating scale anchored by two bipolar adjectives at each end, used to measure attitudes or perceptions (e.g., Ineffective [1–2–3–4–5] Effective).
SF-36 / SF-12Short Form-36 / Short Form-12 — widely validated instruments measuring health-related quality of life across eight domains, used extensively in biomedical research.
Simple Random SamplingA probability sampling method in which every individual in the sampling frame has an equal probability of selection, typically via random number generation.
Snowball SamplingA non-probability technique in which existing participants recruit further participants from their networks; useful for hard-to-reach populations.
Social Desirability BiasThe tendency of respondents to answer in ways they believe are socially acceptable rather than truthfully.
Split-Half ReliabilityA measure of internal consistency obtained by dividing a scale into two halves and correlating their scores, corrected using the Spearman-Brown formula.
STROBESTrengthening the Reporting of OBservational studies in Epidemiology — a reporting checklist applicable to observational research including surveys.
Stratified SamplingA probability sampling method in which the population is divided into subgroups (strata) and random samples are drawn from each, ensuring representation.
Structural ValidityEvidence that the factor structure of an instrument matches its theoretical model, assessed via confirmatory factor analysis (CFA).
SurveyThe complete research process of collecting, analyzing, and interpreting data from a sample using a questionnaire. Encompasses instrument design, sampling, administration, and analysis.
Systematic SamplingA probability sampling method in which every kth individual from a sampling frame is selected (e.g., every 10th patient from a clinic register).
Target PopulationThe specific group to whom the researcher intends to generalize findings (e.g., “adults aged 18–65 living with Type 2 diabetes in urban Maharashtra”).
Test-Retest ReliabilityThe stability of scores from the same instrument administered to the same participants on two separate occasions. Assessed with Pearson’s r or ICC.
TriangulationThe use of multiple methods, data sources, or investigators to cross-validate findings and strengthen the overall rigor of a study.
ValidityThe degree to which an instrument measures what it is intended to measure. Encompasses multiple subtypes including content, criterion, and construct validity.
VariableAny characteristic or attribute that can vary across individuals or observations. Includes outcome variables (dependent) and predictor variables (independent).
WeightingA statistical adjustment applied to survey data to correct for unequal probabilities of selection or differential non-response, improving representativeness of estimates.

Terms are listed alphabetically. Definitions are contextualized for survey and questionnaire research.

Related post

Featured post

Comment

There are no comment yet.

TOP