Use LEFT and RIGHT arrow keys to navigate between flashcards;
Use UP and DOWN arrow keys to flip the card;
H to show hint;
A reads text to speech;
124 Cards in this Set
- Front
- Back
Probability |
Knowing how likely an event is given knowledge about pre-existing conditions. |
|
Descriptive Statistics |
Numerical description of something quantifiable - average. Allows us to understand how representative the data set is of the population we are interested in. The larger the sample size - more useful it is. |
|
Inferential Statistics |
The chance of an event occuring given certain parameters. Useful when impractical to have all the data for a population you are interested in. Used to generate or test hypotheses about particular populations - opinion poll. Make conclusions about the available data but those conclusions reach beyond what the data shows. |
|
Statistical Significance |
Result is unlikely to have arisen by chance alone. Dependent on there being a real relationship between the variables being measured. |
|
Null Hypothesis |
Hypothesis that there is no relationship between variables. Rejected if a significantly significant result is found. |
|
P Value |
The chance of obtaining a result at least as extreme if the null hypothesis were true. < 0.05 = < 5 % chance that the result of a study would have occurred if there were no relationship between the varibales being studied. |
|
Type I Error |
False rejection of null hypothesis when there is not a relationship between the variables being studied. Occurs if you perform +++ studies. False positive. |
|
Type II Error |
Failing to reject the null hypothesis when it is false - not finding a relationship between the variables when there is one. False negative |
|
Reliability |
How likely a result from a study is to be consistently replicated when repeated in a similar manner. |
|
Levels of Measurement |
Different types of data - hierarchy within data types - some are more useful than others. |
|
Nominal Data |
Simplest data type. Data that are a name for something. Only mathematical operation that can be performed is mode. |
|
Ordinal Data |
Can be put into a meaningful order but the degree of difference between data points is unknown - ranking of preferences to a medication. Able to use median and mode. |
|
Interval Data |
Fixed mathmatical difference between each point - temperature. Data doesn not have a zero point. Cannot multiply data. |
|
Ratio/scale Data |
Definite zero point. Open to all mathematical manipulation. |
|
Mean |
Arithmetical average. Every number is added together and then divided by how many numbers there are. |
|
Median |
Value that falls in the middle of the data set if all the values are put in order of size. Use to describe skewed distributions. |
|
Mode |
Most frequently occuring value in a data set. If bimodal distribution - 2 values. Used for non-numerical data. |
|
Range
|
Dispersion of the values. |
|
Percentile/Centile
|
|
|
Quartiles |
50th centile 75th centile 100th centile Interquartile range - middle 50% of the date between 25th and 75th centiles. |
|
Standard Deviation |
Variation from the mean - larger the value the more spread out the data is. |
|
Normal/Gaussian Distribution |
Median = mean = mode. 68.3 % of data will lie within 1 STD of mean. 95.4 % of data will lie within 2 STDs of mean. 99.7 % of data will lie within 3 STDs of mean. |
|
Skewed Distributions
|
Negative skew - mode > mean and median. Positive skew - mode < mean and median. |
|
Pie Chart
|
Shows relative frequencies for a relatively small selection of categories. |
|
Bar Chart
|
The length of bars is proportional to the values they represent. Compare the frequency of variables. |
|
Histograms
|
Show the shape of the distribution of the data. Continuous data are split into ranges known as bins. The area of a bar represents the number of data points falling within that range. |
|
Line graphs
|
Often used for time series data. |
|
Box Plots |
Central box: - Central line = median. - Lower edge of box = lower quartile. - Upper edge of box = upper quartile. - Line to left = minimum value. - Line to right = maximum value. |
|
Scatter Plots
|
Used to plot bivariate date - two quantitative variable for each measurement - age and height of study participants. |
|
Correlation |
If positive - both variables change in the same direction - increase pollution = increase asthma. If negative - one variable increases as one variable decreases - increase age = decrease renal function. |
|
Confidence Interval
|
Range within which the population parameter can be expected to be found in a proportion of samples - usually set at 95 %. 99 % - true population parameter is 99 % likely to fall within that range. |
|
Parameters
|
|
|
Parametric methods |
Useful if we want to infer something about a population and more complicated modeling. |
|
Non-parametric methods |
Do not assume that the population studies fits into a particular distribution - more widely applied. |
|
Independent variable
|
Variable that is manipulated to cause an effect in the dependent variable. |
|
Dependent variable
|
Variable that changes as the independent variable is manipulated. |
|
Chi-squared Test |
Non-parametric test for nominal data. Proportions same in two groups if there is no effect of a variable. E.g. Compare smokers and non-smokers. |
|
Mann-Whitney U Test
|
Non-parametric test. Investigates whether differences in the median for 2 groups could have occurred by chance. Ordinal data. E.g. Patients using a rating scale for pain/depression are subjected to different interventions. |
|
Student's T-test
|
Parametric test - equivalent of Mann-Whitney U Test. Paired version - tests whether the mean scores for a single group vary significantly under 2 different conditions - 2 results for each research participant - pre and post exposure. |
|
Spearman's rank correlation coefficient |
Gives a correlation coefficient between -1 and +1. No correlation between the values = 0. E.g. relationship to how depression changes with exercise. |
|
Pearson's product-moment correlation coefficient
|
Gives a value for how strong the relationship is between the 2 variables. Used with linear regression - measure of the linear correlation between the variables. |
|
Linear regression |
E.g. Calorie intake and weight change are measures to determine relationship between increased calorie intake and weight gain. Plot the points on a scatter plot and fit with line of regression. |
|
Logistic regression |
Similar to linear regression except that the dependent variable is nominal rather than interval. |
|
Bias
|
Systematic distortion of results due to factors that have not been allowed for in designing, carrying out and reporting a study. |
|
Quantitative research
|
Understanding the experiences of individuals or small groups.
|
|
Triangulation
|
Combining methods can improve out knowledge more than a single method. Using a 2nd method can help confirm or refute what has been found. Research findings are strengthened if different methods lead to similar results. Can have > 1 observer rather than 2 methods. |
|
Simple random sampling
|
Risk that sample does not reflect the population in certain attributes. |
|
Stratified sampling
|
Sometimes desirable to have research participants with particular attributes. Splitting the population into subgroups with similar characteristics (age/gender) and taking a sample that includes members from all subgroups. |
|
Quota sampling |
Non-probability technique - not random - greater chance that some people will be selected than others. |
|
Cluster sampling
|
Often multistage. |
|
Opportunity sampling |
|
|
Snowball sampling
|
Can give very misleading results - may not reflect the whole population. |
|
Saturation |
Alternatively it may be reached when no more study participants can be found without taking up considerable resources. |
|
Epistemology |
E.g. If someone believes a bridge is safe but when crossing it, it collapses the belief is only an opinion. But if on crossing it, it holds his weight the belief is justified. |
|
Methodology
|
About how epistemology and theoretical underpinnings are applied to answer a research question. |
|
Objectivism |
Holds that there is an objective truth that can be discovered by looking at the world around us. |
|
Constructionism |
Holds that truth is actually socially constructed. |
|
Pragmatism |
Suggests that we reach understanding through what works at the time, with reference to the ideas of others around us. |
|
Ethnography |
Systemic study of a social group or culture. Observation and participant observation and possibly interviews. Should be undertaken in the naturally occurring setting of the study subjects. Aim to construct the narrative from the viewpoint of the studied subject. |
|
Grounded Theory |
In-depth interviews, focus groups and observation groups. Useful when there is little previous exploration of a particular research setting. |
|
Phenomenology
|
Trying to understand the different ways that people think about something. Participant observation and interviews. Explore the motivations and perceptions of the research participants and investigate them in depth. |
|
Action research |
Enacting change and gauging the results of that change. Involves careful planning, observation and critical reflection. Evaluating public health interventions. |
|
Pragmatic research |
Not a single neat qualitative research methodology. All research methods and methodologies have limitations - aims to circumvent these. Uses several methods that are applied appropriately. Needs to have a well thought out research question. |
|
Structured interviews
|
Aim to deliver data that are consistent and do not vary depending on the interviewer. Mask individual variety and do not target a depth of understanding. |
|
Delphi Technique
|
Forecasting techniques that aim to get several experts in a field to predict events.
|
|
The Hierarchy of Evidence |
1) Systemic reviews and meta-analyses 2) Randomised controlled trials 3) Cohort studies 4) Case-control studies 5) Case series and case studies 6) Expert opinion, editorials |
|
Randomised control trials |
Gold standard for medical trials Study participants are randomly allocated to one or more interventions. Most useful - compare an intervention to current gold-standard. |
|
Meta-analyses or systematic reviews |
Combining the results of RCTs |
|
Systematic reviews |
Secondary research. Looks at a particular subject or question - searching for all the relevant studies with an explicit search strategy and evaluating studies using predetermined criteria. Assessing studies. |
|
Meta-analysis |
Combining the selected studies from a systematic review by mathematical means. Increases the number of people studied. Reduces confidence interval. Each study can be given a weighting based on how reliable the study is. Forest plot. |
|
Forest Plot |
Graphical representation of meta-analysis. Size or weighting of study. Confidence interval for the results of each trial. Overall result. Studies on left of graph. Square on horizontal line is centred on the relative risk/odds ratio of the effect. Width of horizontal line - 95 % confidence interval of relative risk/odds ratio. If line crosses the solid vertical line - no significant effect. Solid vertical line - line of no effect. Diamond - pooled result. |
|
Funnel Plot |
Looks for publication bias in systematic reviews and meta-analysis - not all data has been published. Based on assumption that larger studies will be more powerful and precise - nearer the true value than smaller studies. Intervention effect - plotted on log relative risk on x-axis. Estimates are plotted against measure of studies precision or size - standard error. If assymtetrical - suggests that not all studies have been published. |
|
Heterogeneity |
Measure of how different studies are. |
|
Homogeneity |
Measure of how similar studies are. |
|
Cross-over Trial
|
Modified RCT - gives a group of participants an intervention and then after an appropriate time period gives them another intervention. Cheaper than RCT and useful when thought unethical to not give a certain treatment. Washout period - time allowed to allow the effects of the previous intervention to have disappeared. |
|
Risk |
Probability of a particular event occurring. Can be positive - reduction in cardiovascular disease in those taking metoformin. |
|
Relative risk
|
Comparison is made to something other than zero - usually a control group. RR = EER/CER |
|
Absolute risk |
Chance or actual risk of developing a condition with zero meaning there is no risk. |
|
Absolute risk reduction or increase |
ARR = CER-EER. CER - control event rate - how often the event of interest occurs in the control group. EER - experimental event rate - how often that event occurs in the group given the intervention being studied. If > 0 - risk reduction. If < 0 - risk increase. |
|
Relative risk reduction or increase |
Ratio of the probability of an outcome in the experimental group compared with the probability of the same outcome in the control or comparison group. RRR = (CER - EER) / CER RRR= ARR/CER Value of 1 = no difference between the 2 groups being studied. |
|
Hazard ratio
|
Similar to relative risk ratios but it takes into account the timing of events. Once a patient has reached the end point of a study - cure or death they are removed from the analysis. Hazard - rate at which events happen. If = 1 - relative probability in the 2 groups being equal over time. If < 1 - decreased hazard for treatment group. If > 1 - increase hazard for the treatment group. |
|
Number needed to treat/harm |
Inverse of absolute risk reduction - 1/ARR. How many people would have to be treated to prevent/cause one outcome of interest. Assesses effectiveness of an intervention. NNT - 1/(CER-EER) |
|
Cohort study
|
Takes a group of people and sees what happens to them over a period of time - longitudinal study. Used to try and see whether suspected risk factors actually have an impact on people's health and if so what size of impact. Cohort - group of people that have something in common. Doesn't have to have a comparison group. Confounding variables may exist - 3rd variable that is not taken into account. Prospective - look forward in time. |
|
Retrospective cohort study |
Researchers looking at the medical records of of a cohort of people to find information about variables the researchers are interested in. |
|
Relative risk
|
Normal outcome of a cohort study. What is the risk of an event occurring in the exposure group compared to the non-exposure (control) group. RR = EER/CER Relative risk reduction = 1-RR |
|
Case-control study |
Compares people with a condition to those that don't have it. Evaluates potential causes of the condition/risk factors that contribute to the way it develops. Retrospective and rely on people correctly recalling events. Useful if condition is rare. May have spurious correlations. |
|
Odds ratio |
Outcome of a case-control study. Looks at the odds in the case group and compares this with the odds within the control group. Odds of an event are the ratio of people with the condition to people without the condition. Ratio of events to non-events in the condition group and dividing this by the ratio of events to non-events in the control group. OR = (a/c)/(b/d) If < 1 - exposure is protective. If > 1 - exposure is causative. |
|
Statistical a, b, c, d table |
a = exposure and condition present. b = exposure and no disease present. c = no exposure and condition present. d = no exposure and no disease present. |
|
Epidemiology |
Study of health populations. Patterns of health and disease within a population - causative or risk factors. Effects of health and disease on populations. |
|
Primary prevention |
Aims to prevent people from developing diseases or protect them from harm. Vaccines Weight loss/lifestyle advice |
|
Secondary prevention |
Interventions after a disease, condition or risk factor occurs before it has caused any harm. Prescribing medication to people diagnosed with hypertension |
|
Tertiary prevention |
Acting after the disease or condition has been diagnosed and has caused harm and symptoms. Minimise harm from an acute event, improve quality of life and prevent future complications. Treating someone following an MI with a statin, anti-platelet, beta-blocker and ACEi. |
|
Incidence |
Number of new cases of a disease or condition in a particular population starting in a given period of time. Rate of new cases in relation to a population. Rate is given in a time period. |
|
Incidence Rate |
Incidence rate = number of new cases starting during a specified time period / average population during specified time interval. |
|
Incidence Proportion |
Incidence proportion = number of new cases during specified time interval / population at start of time period. |
|
Prevalence |
Proportion of cases in a given population at a given time. For short-term illnesses - prevalence = incidence For long-term illnesses - incidence < prevalence ie less new cases than those who have the disease. |
|
Prevalence
|
Prevalence = total number of people with a condition at a given time / population |
|
Sensitivity
|
How good a test is at detecting people with the condition. High sensitivity = does not miss many cases of disease. |
|
Sensitivity |
Sensitivity = true positive / number of people with the condition Sensitivity = TP / TP + FN |
|
Specificity |
How good a test is at excluding those people without the condition. High specificity = low number of false positives |
|
Specificity |
Specificity = true negatives / number of people without the condition Specificity = TN / TN + FP |
|
Positive Predictive Value |
How likely someone who tests positive is to actually have the condition being tested for. The larger the PPV, the lower the number of false positives the test will result in. Increases as prevalence increases. |
|
Positive Predictive Value |
PPV = true positives / true positives + false positives PPV = TP / TP + FP |
|
Negative Predictive Value |
How likely someone who tests negative is to not have the condition being tested for. The higher the NPV the more likely someone who tests negative will not have the condition. Decreases as prevalence increases. |
|
Negative Predictive Value |
NPV = true negatives / true negatives + false negatives NPV = TN / TN + FP |
|
Prior Probability |
Probability before the test of someone having the disease is the same as the prevalence of that disease in the population from which the person is drawn. |
|
Posterior Probability
|
Revised probability of a person having a disease based on the predictive values and their test results. |
|
Likelihood Ratios
|
Compare the probability of a test result if the patient has the condition with the probability of the same result if the patient does not have the condition. Value > 1 test result is associated with the presence of the condition. Value < 1 test result is associated with the absence of the condition. Value = 1 - no diagnostic value of the test result |
|
Likelihood Ratio |
Likelihood Ratio = probability of finding in patients with the disease / probability of finding in patients without disease |
|
Likelihood ratio for a positive result |
Positive LR = sensitivity / 1 - specificity |
|
Likelihood ratio for a negative result |
Negative LR = 1 - sensitivity / specificity |
|
Fagan's Nomogram |
First line - pre-test probability = disease prevalence. Second line - likelihood ratio. Third line - post-test probability - calculated by drawing a straight line through first and second lines. |
|
Mortality rate |
Number of deaths in a specified population in a specified time. |
|
Survival rate |
Percentage of people alive at a specified time after diagnosis. |
|
Relative survival rate |
Relative survival rate = survival rate after diagnosis / survival rate observed in a similar population without the disease. |
|
Cause specific survival |
Removes people who die of other causes than the one of interest from the population. |
|
Life expectancy |
Estimate of how long a newborn baby will live based on current age-specific mortality rates. |
|
Child Mortality Rate |
Annual number of deaths in children aged between 1 and 4 years - rate per 1000 children in that age group. |
|
Infant Mortality rate |
Children younger than 1 year of age given as a rate per 1000 liver births in that year. |
|
Neonatal mortality rate |
Deaths in infants younger than 28 days of age. |
|
Case fatality rate
|
Number of deaths due to a particular condition, compared with the number of people who contract the condition, over the course of a disease or condition. |
|
Median survival rate |
Time when half the patients are expected to be alive. |
|
Quality Adjusted Life Years |
QALY - standardised measure for comparing different treatments. It is the average of additional years of life gained multiplied by a factor that takes into account the quality of life that may be expected in those additional years of life. One QALY is a year of life gained with perfect health. |
|
Disability Adjusted Life Years |
DALY - takes the negative impact of a disability into account. One DALY = 1 lost year of healthy life. |