Set the Language

We weren't able to detect the audio language on your flashcards. Please select the correct language below.

Front

Back

Related Flashcards

Flashcards
»
Statistics AKT

Statistics Akt

by MarciaClark, Aug 2015

Favorite

Add to folder

Flag

Related Essays

Differences Between Experiments And Demonstrations Of A Laboratory
Presentation and analysis of data: Statistics is a useful way to collect and organize data to make future inferences and to refer back to. Through the help...
Emory Healthcare Case Study
Inferential statistics use a random sample of data taken from a population to describe and make inferences about the whole population. In order for the resea...
What Is B. F. Skinner, Positive And Negative Reinforcement?
Stevens published “On the Theory of Scales of Measurement”. This described the characteristics and applications of four types or levels of data. According to...
Null Hypothesis Case Study
On testing the null hypothesis, two main decision rules are plausible. These decisions are arrived at using either the critical value or the probability valu...
Quantitative Data Analysis
The shape the histogram takes is that the collected data from the x-variable is skewed-right. The fact that the data is skewed-right means the mean is substa...
Stephen M. Stigler's The History Of Statistics
Statistics, mathematical concepts according to The History to Statistics … are the
BUS308: Statistics For Managers
Statistics Robert Chipman BUS308: Statistics for Managers Instructor Edward Kaplan September 25, 2015 Descriptive statistics I can describe the d...
Hypergeometric And Negative Binomial Distribution
Different statistics variables such as random, discrete, and expected are helpful in the study of statistics. These different types of distributions use th...
Conformity Experiment
The median was calculated by finding the middle number from the range of data and the mode was calculated by seeing which number occurred the most often. The...
Regression Analysis In Healthcare
For instance, patterns might develop in the data. Although descriptive statistics help us digest data, they do not allow us to reach a conclusion as regards ...

Shuffle
Toggle On

Toggle Off
Alphabetize
Toggle On

Toggle Off
Front First
Toggle On

Toggle Off
Both Sides
Toggle On

Toggle Off
Read
Toggle On

Toggle Off

Reading...

Front

Card Range To Study

through

Play button

Progress

1/124

Click to flip

Use LEFT and RIGHT arrow keys to navigate between flashcards;

Use UP and DOWN arrow keys to flip the card;

H to show hint;

A reads text to speech;

124 Cards in this Set

Front
Back

	Probability	Knowing how likely an event is given knowledge about pre-existing conditions.
	Descriptive Statistics	Numerical description of something quantifiable - average. Allows us to understand how representative the data set is of the population we are interested in. The larger the sample size - more useful it is.
	Inferential Statistics	The chance of an event occuring given certain parameters. Useful when impractical to have all the data for a population you are interested in. Used to generate or test hypotheses about particular populations - opinion poll. Make conclusions about the available data but those conclusions reach beyond what the data shows.
	Statistical Significance	Result is unlikely to have arisen by chance alone. Dependent on there being a real relationship between the variables being measured.
	Null Hypothesis	Hypothesis that there is no relationship between variables. Rejected if a significantly significant result is found.
	P Value	The chance of obtaining a result at least as extreme if the null hypothesis were true. < 0.05 = < 5 % chance that the result of a study would have occurred if there were no relationship between the varibales being studied.
	Type I Error	False rejection of null hypothesis when there is not a relationship between the variables being studied. Occurs if you perform +++ studies. False positive.
	Type II Error	Failing to reject the null hypothesis when it is false - not finding a relationship between the variables when there is one. False negative
	Reliability	How likely a result from a study is to be consistently replicated when repeated in a similar manner.
	Levels of Measurement	Different types of data - hierarchy within data types - some are more useful than others.
	Nominal Data	Simplest data type. Data that are a name for something. Only mathematical operation that can be performed is mode.
	Ordinal Data	Can be put into a meaningful order but the degree of difference between data points is unknown - ranking of preferences to a medication. Able to use median and mode.
	Interval Data	Fixed mathmatical difference between each point - temperature. Data doesn not have a zero point. Cannot multiply data.
	Ratio/scale Data	Definite zero point. Open to all mathematical manipulation.
	Mean	Arithmetical average. Every number is added together and then divided by how many numbers there are.
	Median	Value that falls in the middle of the data set if all the values are put in order of size. Use to describe skewed distributions.
	Mode	Most frequently occuring value in a data set. If bimodal distribution - 2 values. Used for non-numerical data.
	Range	Difference between the largest and smallest values in a data set. Dispersion of the values.
	Percentile/Centile	Number of data points divided by 100 with the data set out in order, normally from smallest to largest.
	Quartiles	25th centile 50th centile 75th centile 100th centile Interquartile range - middle 50% of the date between 25th and 75th centiles.
	Standard Deviation	Variation from the mean - larger the value the more spread out the data is.
	Normal/Gaussian Distribution	Theoretical distribution that is a bell-shaped curve that is symmetrical around the mean value of the sample or population. Median = mean = mode. 68.3 % of data will lie within 1 STD of mean. 95.4 % of data will lie within 2 STDs of mean. 99.7 % of data will lie within 3 STDs of mean.
	Skewed Distributions	Data with a longer tail to one side of the central measure when plotted. Negative skew - mode > mean and median. Positive skew - mode < mean and median.
	Pie Chart	Shows relative frequencies for a relatively small selection of categories.
	Bar Chart	The length of bars is proportional to the values they represent. Compare the frequency of variables.
	Histograms	Display continuous data such as interval and ratio data. Show the shape of the distribution of the data. Continuous data are split into ranges known as bins. The area of a bar represents the number of data points falling within that range.
	Line graphs	Show continuous data. Often used for time series data.
	Box Plots	Distribution of interval data with the central value and variability shown. Central box: - Central line = median. - Lower edge of box = lower quartile. - Upper edge of box = upper quartile. - Line to left = minimum value. - Line to right = maximum value.
	Scatter Plots	Used to plot bivariate date - two quantitative variable for each measurement - age and height of study participants.
	Correlation	Measure of the extent to which one variable changes as another variable changes. If positive - both variables change in the same direction - increase pollution = increase asthma. If negative - one variable increases as one variable decreases - increase age = decrease renal function.
	Confidence Interval	Estimate of the range of values within which the true parameter lies. Range within which the population parameter can be expected to be found in a proportion of samples - usually set at 95 %. 99 % - true population parameter is 99 % likely to fall within that range.
	Parameters	Values that describe a complete population - statistic is a value calculated from a sample.
	Parametric methods	Statistical methods used when the population we are studying fits into a normal/related distribution. Useful if we want to infer something about a population and more complicated modeling.
	Non-parametric methods	Do not assume that the population studies fits into a particular distribution - more widely applied.
	Independent variable	Variable that is manipulated to cause an effect in the dependent variable.
	Dependent variable	Variable that changes as the independent variable is manipulated.
	Chi-squared Test	Used to compare observed data with an expected outcome - ie according to the null hypothesis. Non-parametric test for nominal data. Proportions same in two groups if there is no effect of a variable. E.g. Compare smokers and non-smokers.
	Mann-Whitney U Test	Non-parametric test. Investigates whether differences in the median for 2 groups could have occurred by chance. Ordinal data. E.g. Patients using a rating scale for pain/depression are subjected to different interventions.
	Student's T-test	Tests whether the mean valuable in the dependent variable is the same for each of the two groups. Parametric test - equivalent of Mann-Whitney U Test. Paired version - tests whether the mean scores for a single group vary significantly under 2 different conditions - 2 results for each research participant - pre and post exposure.
	Spearman's rank correlation coefficient	Ordinal or interval data. Gives a correlation coefficient between -1 and +1. No correlation between the values = 0. E.g. relationship to how depression changes with exercise.
	Pearson's product-moment correlation coefficient	Parametric test - both variables are interval level data. Gives a value for how strong the relationship is between the 2 variables. Used with linear regression - measure of the linear correlation between the variables.
	Linear regression	Used when you have two interval-level variables whereby you have two readings for each research participant. E.g. Calorie intake and weight change are measures to determine relationship between increased calorie intake and weight gain. Plot the points on a scatter plot and fit with line of regression.
	Logistic regression	Used when dependent variable is nominal with 2 values - yes or no. Similar to linear regression except that the dependent variable is nominal rather than interval.
	Bias	Systematic distortion of results due to factors that have not been allowed for in designing, carrying out and reporting a study.
	Quantitative research	Understanding the experiences of individuals or small groups.
	Triangulation	Combining methods can improve out knowledge more than a single method. Using a 2nd method can help confirm or refute what has been found. Research findings are strengthened if different methods lead to similar results. Can have > 1 observer rather than 2 methods.
	Simple random sampling	Randomly select participants from a population - patients within a GP surgery. Risk that sample does not reflect the population in certain attributes.
	Stratified sampling	Sometimes desirable to have research participants with particular attributes. Splitting the population into subgroups with similar characteristics (age/gender) and taking a sample that includes members from all subgroups.
	Quota sampling	Population is split into subgroups like stratified sampling, but quotas are set in each group. Non-probability technique - not random - greater chance that some people will be selected than others.
	Cluster sampling	Large population that is split into groups and then samples are taken from within those groups - used when cost of random sampling would be an issue. Often multistage.
	Opportunity sampling	Individuals are selected through convenience without any randomization.
	Snowball sampling	Initial contact is made with an individual/group and associated of them are also asked to take part in the research Can give very misleading results - may not reflect the whole population.
	Saturation	Reached when no new data are collected over a predetermined number of interviews. Alternatively it may be reached when no more study participants can be found without taking up considerable resources.
	Epistemology	Study of what can be known in terms of which beliefs can be justified and which are simply opinion. E.g. If someone believes a bridge is safe but when crossing it, it collapses the belief is only an opinion. But if on crossing it, it holds his weight the belief is justified.
	Methodology	Methods, principles and rules that are used in undertaking research. About how epistemology and theoretical underpinnings are applied to answer a research question.
	Objectivism	Holds that there is an objective truth that can be discovered by looking at the world around us.
	Constructionism	Holds that truth is actually socially constructed.
	Pragmatism	Suggests that we reach understanding through what works at the time, with reference to the ideas of others around us.
	Ethnography	Systemic study of a social group or culture. Observation and participant observation and possibly interviews. Should be undertaken in the naturally occurring setting of the study subjects. Aim to construct the narrative from the viewpoint of the studied subject.
	Grounded Theory	Tries to avoid preconceived ideas and instead allows theories to develop from the research data. In-depth interviews, focus groups and observation groups. Useful when there is little previous exploration of a particular research setting.
	Phenomenology	Study of subjective experience. Trying to understand the different ways that people think about something. Participant observation and interviews. Explore the motivations and perceptions of the research participants and investigate them in depth.
	Action research	Aims to solve a problem. Enacting change and gauging the results of that change. Involves careful planning, observation and critical reflection. Evaluating public health interventions.
	Pragmatic research	Not a single neat qualitative research methodology. All research methods and methodologies have limitations - aims to circumvent these. Uses several methods that are applied appropriately. Needs to have a well thought out research question.
	Structured interviews	Aim to deliver data that are consistent and do not vary depending on the interviewer. Mask individual variety and do not target a depth of understanding.
	Delphi Technique	Forecasting techniques that aim to get several experts in a field to predict events.
	The Hierarchy of Evidence	1) Systemic reviews and meta-analyses 2) Randomised controlled trials 3) Cohort studies 4) Case-control studies 5) Case series and case studies 6) Expert opinion, editorials
	Randomised control trials	Gold standard for medical trials Study participants are randomly allocated to one or more interventions. Most useful - compare an intervention to current gold-standard.
	Meta-analyses or systematic reviews	Combining the results of RCTs
	Systematic reviews	Secondary research. Looks at a particular subject or question - searching for all the relevant studies with an explicit search strategy and evaluating studies using predetermined criteria. Assessing studies.
	Meta-analysis	Combining the selected studies from a systematic review by mathematical means. Increases the number of people studied. Reduces confidence interval. Each study can be given a weighting based on how reliable the study is. Forest plot.
	Forest Plot	Graphical representation of meta-analysis. Size or weighting of study. Confidence interval for the results of each trial. Overall result. Studies on left of graph. Square on horizontal line is centred on the relative risk/odds ratio of the effect. Width of horizontal line - 95 % confidence interval of relative risk/odds ratio. If line crosses the solid vertical line - no significant effect. Solid vertical line - line of no effect. Diamond - pooled result.
	Funnel Plot	Looks for publication bias in systematic reviews and meta-analysis - not all data has been published. Based on assumption that larger studies will be more powerful and precise - nearer the true value than smaller studies. Intervention effect - plotted on log relative risk on x-axis. Estimates are plotted against measure of studies precision or size - standard error. If assymtetrical - suggests that not all studies have been published.
	Heterogeneity	Measure of how different studies are.
	Homogeneity	Measure of how similar studies are.
	Cross-over Trial	Modified RCT - gives a group of participants an intervention and then after an appropriate time period gives them another intervention. Cheaper than RCT and useful when thought unethical to not give a certain treatment. Washout period - time allowed to allow the effects of the previous intervention to have disappeared.
	Risk	Probability of a particular event occurring. Can be positive - reduction in cardiovascular disease in those taking metoformin.
	Relative risk	Comparison is made to something other than zero - usually a control group. RR = EER/CER
	Absolute risk	Chance or actual risk of developing a condition with zero meaning there is no risk.
	Absolute risk reduction or increase	ARR = CER-EER. CER - control event rate - how often the event of interest occurs in the control group. EER - experimental event rate - how often that event occurs in the group given the intervention being studied. If > 0 - risk reduction. If < 0 - risk increase.
	Relative risk reduction or increase	Ratio of the probability of an outcome in the experimental group compared with the probability of the same outcome in the control or comparison group. RRR = (CER - EER) / CER RRR= ARR/CER Value of 1 = no difference between the 2 groups being studied.
	Hazard ratio	Similar to relative risk ratios but it takes into account the timing of events. Once a patient has reached the end point of a study - cure or death they are removed from the analysis. Hazard - rate at which events happen. If = 1 - relative probability in the 2 groups being equal over time. If < 1 - decreased hazard for treatment group. If > 1 - increase hazard for the treatment group.
	Number needed to treat/harm	Inverse of absolute risk reduction - 1/ARR. How many people would have to be treated to prevent/cause one outcome of interest. Assesses effectiveness of an intervention. NNT - 1/(CER-EER)
	Cohort study	Takes a group of people and sees what happens to them over a period of time - longitudinal study. Used to try and see whether suspected risk factors actually have an impact on people's health and if so what size of impact. Cohort - group of people that have something in common. Doesn't have to have a comparison group. Confounding variables may exist - 3rd variable that is not taken into account. Prospective - look forward in time.
	Retrospective cohort study	Researchers looking at the medical records of of a cohort of people to find information about variables the researchers are interested in.
	Relative risk	Normal outcome of a cohort study. What is the risk of an event occurring in the exposure group compared to the non-exposure (control) group. RR = EER/CER Relative risk reduction = 1-RR
	Case-control study	Compares people with a condition to those that don't have it. Evaluates potential causes of the condition/risk factors that contribute to the way it develops. Retrospective and rely on people correctly recalling events. Useful if condition is rare. May have spurious correlations.
	Odds ratio	Outcome of a case-control study. Looks at the odds in the case group and compares this with the odds within the control group. Odds of an event are the ratio of people with the condition to people without the condition. Ratio of events to non-events in the condition group and dividing this by the ratio of events to non-events in the control group. OR = (a/c)/(b/d) If < 1 - exposure is protective. If > 1 - exposure is causative.
	Statistical a, b, c, d table	a = exposure and condition present. b = exposure and no disease present. c = no exposure and condition present. d = no exposure and no disease present.
	Epidemiology	Study of health populations. Patterns of health and disease within a population - causative or risk factors. Effects of health and disease on populations.
	Primary prevention	Aims to prevent people from developing diseases or protect them from harm. Vaccines Weight loss/lifestyle advice
	Secondary prevention	Interventions after a disease, condition or risk factor occurs before it has caused any harm. Prescribing medication to people diagnosed with hypertension
	Tertiary prevention	Acting after the disease or condition has been diagnosed and has caused harm and symptoms. Minimise harm from an acute event, improve quality of life and prevent future complications. Treating someone following an MI with a statin, anti-platelet, beta-blocker and ACEi.
	Incidence	Number of new cases of a disease or condition in a particular population starting in a given period of time. Rate of new cases in relation to a population. Rate is given in a time period.
	Incidence Rate	Incidence rate = number of new cases starting during a specified time period / average population during specified time interval.
	Incidence Proportion	Incidence proportion = number of new cases during specified time interval / population at start of time period.
	Prevalence	Proportion of cases in a given population at a given time. For short-term illnesses - prevalence = incidence For long-term illnesses - incidence < prevalence ie less new cases than those who have the disease.
	Prevalence	Prevalence = total number of people with a condition at a given time / population
	Sensitivity	How good a test is at detecting people with the condition. High sensitivity = does not miss many cases of disease.
	Sensitivity	Sensitivity = true positive / number of people with the condition Sensitivity = TP / TP + FN
	Specificity	How good a test is at excluding those people without the condition. High specificity = low number of false positives
	Specificity	Specificity = true negatives / number of people without the condition Specificity = TN / TN + FP
	Positive Predictive Value	How likely someone who tests positive is to actually have the condition being tested for. The larger the PPV, the lower the number of false positives the test will result in. Increases as prevalence increases.
	Positive Predictive Value	PPV = true positives / true positives + false positives PPV = TP / TP + FP
	Negative Predictive Value	How likely someone who tests negative is to not have the condition being tested for. The higher the NPV the more likely someone who tests negative will not have the condition. Decreases as prevalence increases.
	Negative Predictive Value	NPV = true negatives / true negatives + false negatives NPV = TN / TN + FP
	Prior Probability	Probability before the test of someone having the disease is the same as the prevalence of that disease in the population from which the person is drawn.
	Posterior Probability	Revised probability of a person having a disease based on the predictive values and their test results.
	Likelihood Ratios	Compare the probability of a test result if the patient has the condition with the probability of the same result if the patient does not have the condition. Value > 1 test result is associated with the presence of the condition. Value < 1 test result is associated with the absence of the condition. Value = 1 - no diagnostic value of the test result
	Likelihood Ratio	Likelihood Ratio = probability of finding in patients with the disease / probability of finding in patients without disease
	Likelihood ratio for a positive result	Positive LR = sensitivity / 1 - specificity
	Likelihood ratio for a negative result	Negative LR = 1 - sensitivity / specificity
	Fagan's Nomogram	First line - pre-test probability = disease prevalence. Second line - likelihood ratio. Third line - post-test probability - calculated by drawing a straight line through first and second lines.
	Mortality rate	Number of deaths in a specified population in a specified time.
	Survival rate	Percentage of people alive at a specified time after diagnosis.
	Relative survival rate	Relative survival rate = survival rate after diagnosis / survival rate observed in a similar population without the disease.
	Cause specific survival	Removes people who die of other causes than the one of interest from the population.
	Life expectancy	Estimate of how long a newborn baby will live based on current age-specific mortality rates.
	Child Mortality Rate	Annual number of deaths in children aged between 1 and 4 years - rate per 1000 children in that age group.
	Infant Mortality rate	Children younger than 1 year of age given as a rate per 1000 liver births in that year.
	Neonatal mortality rate	Deaths in infants younger than 28 days of age.
	Case fatality rate	Number of deaths due to a particular condition, compared with the number of people who contract the condition, over the course of a disease or condition.
	Median survival rate	Time when half the patients are expected to be alive.
	Quality Adjusted Life Years	QALY - standardised measure for comparing different treatments. It is the average of additional years of life gained multiplied by a factor that takes into account the quality of life that may be expected in those additional years of life. One QALY is a year of life gained with perfect health.
	Disability Adjusted Life Years	DALY - takes the negative impact of a disability into account. One DALY = 1 lost year of healthy life.

Share This Flashcard Set