Basic statistics for experimental medical researchers

Relaterede dokumenter
Kursus 02323: Introducerende Statistik. Forelæsning 12: Forsøgsplanlægning. Peder Bacher

Vina Nguyen HSSP July 13, 2008

Statistik for MPH: 7

Statistik for MPH: oktober Attributable risk, bestemmelse af stikprøvestørrelse (Silva: , )

Generalized Probit Model in Design of Dose Finding Experiments. Yuehui Wu Valerii V. Fedorov RSU, GlaxoSmithKline, US

Oversigt. 1 Motiverende eksempel - energiforbrug. 2 Hypotesetest (Repetition) 3 Two-sample t-test og p-værdi. 4 Konfidensinterval for forskellen

Project Step 7. Behavioral modeling of a dual ported register set. 1/8/ L11 Project Step 5 Copyright Joanne DeGroat, ECE, OSU 1

Black Jack --- Review. Spring 2012

Linear Programming ١ C H A P T E R 2

Reexam questions in Statistics and Evidence-based medicine, august sem. Medis/Medicin, Modul 2.4.

Skriftlig Eksamen Kombinatorik, Sandsynlighed og Randomiserede Algoritmer (DM528)

Vores mange brugere på musskema.dk er rigtig gode til at komme med kvalificerede ønsker og behov.

Den nye Eurocode EC Geotenikerdagen Morten S. Rasmussen

Skriftlig Eksamen Diskret matematik med anvendelser (DM72)

The X Factor. Målgruppe. Læringsmål. Introduktion til læreren klasse & ungdomsuddannelser Engelskundervisningen

Measuring the Impact of Bicycle Marketing Messages. Thomas Krag Mobility Advice Trafikdage i Aalborg,

X M Y. What is mediation? Mediation analysis an introduction. Definition

Engelsk. Niveau D. De Merkantile Erhvervsuddannelser September Casebaseret eksamen. og

Financial Literacy among 5-7 years old children

DoodleBUGS (Hands-on)

Kvant Eksamen December timer med hjælpemidler. 1 Hvad er en continuous variable? Giv 2 illustrationer.

PARALLELIZATION OF ATTILA SIMULATOR WITH OPENMP MIGUEL ÁNGEL MARTÍNEZ DEL AMOR MINIPROJECT OF TDT24 NTNU

CHAPTER 8: USING OBJECTS

Barnets navn: Børnehave: Kommune: Barnets modersmål (kan være mere end et)

Terese B. Thomsen 1.semester Formidling, projektarbejde og webdesign ITU DMD d. 02/

Aktivering af Survey funktionalitet

Besvarelser til Lineær Algebra Reeksamen Februar 2017

How Long Is an Hour? Family Note HOME LINK 8 2

Brug sømbrættet til at lave sjove figurer. Lav fx: Få de andre til at gætte, hvad du har lavet. Use the nail board to make funny shapes.

To the reader: Information regarding this document

Trolling Master Bornholm 2012

Kursus 02402/02323 Introducerende Statistik. Forelæsning 6: Sammenligning af to grupper

Agenda. The need to embrace our complex health care system and learning to do so. Christian von Plessen Contributors to healthcare services in Denmark

Privat-, statslig- eller regional institution m.v. Andet Added Bekaempelsesudfoerende: string No Label: Bekæmpelsesudførende

28 April 2003 Retrospective: Semicore Visit

Observation Processes:

Business Opening. Very formal, recipient has a special title that must be used in place of their name

Skriftlig Eksamen Beregnelighed (DM517)

Business Opening. Very formal, recipient has a special title that must be used in place of their name

Statistik for MPH: november Attributable risk, bestemmelse af stikprøvestørrelse (Silva: , )

USERTEC USER PRACTICES, TECHNOLOGIES AND RESIDENTIAL ENERGY CONSUMPTION

Userguide. NN Markedsdata. for. Microsoft Dynamics CRM v. 1.0

Engelsk. Niveau C. De Merkantile Erhvervsuddannelser September Casebaseret eksamen. og

Statistik. Statistik. Hvad er Statistik? Hvad er Statistik? Hvad er Statistik? 1. Hvad er statistik? 2. Mennesker som måleinstrumenter

IBM Network Station Manager. esuite 1.5 / NSM Integration. IBM Network Computer Division. tdc - 02/08/99 lotusnsm.prz Page 1

Jens Olesen, MEd Fysioterapeut, Klinisk vejleder Specialist i rehabilitering

Exercise 6.14 Linearly independent vectors are also affinely independent.

Hvor er mine runde hjørner?

On the complexity of drawing trees nicely: corrigendum

Trolling Master Bornholm 2015

Our activities. Dry sales market. The assortment

Probabilistic properties of modular addition. Victoria Vysotskaya

Nyhedsmail, december 2013 (scroll down for English version)

Appendix 1: Interview guide Maria og Kristian Lundgaard-Karlshøj, Ausumgaard

DENCON ARBEJDSBORDE DENCON DESKS

Trolling Master Bornholm 2014?

Det er muligt at chekce følgende opg. i CodeJudge: og

Exam questions in Statistics and evidence-based medicine, spring sem. Medis/Medicin, Modul 2.4.

CS 4390/5387 SOFTWARE V&V LECTURE 5 BLACK-BOX TESTING - 2

Trolling Master Bornholm 2013

We hope you have enjoyed your holiday and that you are willing to help us improve our holiday support programme by completing this questionnaire.

Kapitalstruktur i Danmark. M. Borberg og J. Motzfeldt

DET KONGELIGE BIBLIOTEK NATIONALBIBLIOTEK OG KØBENHAVNS UNIVERSITETS- BIBLIOTEK. Index

Quality indicators for clinical pharmacy services

Evaluating Germplasm for Resistance to Reniform Nematode. D. B. Weaver and K. S. Lawrence Auburn University

Portal Registration. Check Junk Mail for activation . 1 Click the hyperlink to take you back to the portal to confirm your registration

Resource types R 1 1, R 2 2,..., R m CPU cycles, memory space, files, I/O devices Each resource type R i has W i instances.

Central Statistical Agency.

GUIDE TIL BREVSKRIVNING

RoE timestamp and presentation time in past

ECE 551: Digital System * Design & Synthesis Lecture Set 5

Oversigt. Kursus 02402/02323 Introducerende Statistik. Forelæsning 5: Hypotesetest, power og modelkontrol - one sample

Velkommen til IFF QA erfa møde d. 15. marts Erfaringer med miljømonitorering og tolkning af nyt anneks 1.

DK - Quick Text Translation. HEYYER Net Promoter System Magento extension

Engineering of Chemical Register Machines

Skriftlig Eksamen Beregnelighed (DM517)

Kalkulation: Hvordan fungerer tal? Jan Mouritsen, professor Institut for Produktion og Erhvervsøkonomi

Molio specifications, development and challenges. ICIS DA 2019 Portland, Kim Streuli, Molio,

mandag den 23. september 13 Konceptkommunikation

Particle-based T-Spline Level Set Evolution for 3D Object Reconstruction with Range and Volume Constraints

E-PAD Bluetooth hængelås E-PAD Bluetooth padlock E-PAD Bluetooth Vorhängeschloss

PROBLEMLØSNING - HVAD KAN DET?

LESSON NOTES Extensive Reading in Danish for Intermediate Learners #8 How to Interview

Unitel EDI MT940 June Based on: SWIFT Standards - Category 9 MT940 Customer Statement Message (January 2004)

Listen Mr Oxford Don, Additional Work

Department of Public Health. Case-control design. Katrine Strandberg-Larsen Department of Public Health, Section of Social Medicine

Managing stakeholders on major projects. - Learnings from Odense Letbane. Benthe Vestergård Communication director Odense Letbane P/S

Kursus 02402/02323 Introduktion til statistik. Forelæsning 13: Et overblik over kursets indhold. Klaus K. Andersen og Per Bruun Brockhoff

Skriftlig Eksamen Automatteori og Beregnelighed (DM17)

Internationalt uddannelsestilbud

Special VFR. - ved flyvning til mindre flyveplads uden tårnkontrol som ligger indenfor en kontrolzone

Eksempel på eksamensspørgsmål til caseeksamen

Trolling Master Bornholm 2016 Nyhedsbrev nr. 5

Tema: Pets Fag: Engelsk Målgruppe: 4. klasse Titel: Me and my pet Vejledning Lærer

Forslag til implementering af ResearcherID og ORCID på SCIENCE

Trolling Master Bornholm 2016 Nyhedsbrev nr. 6

Mellem selvbestemmelse og omsorgspligt - etiske dillemaer. Kasper Mosekjær Roskilde Universitet

LUL s Flower Power Vest dansk version

Trolling Master Bornholm 2014

Dagens program. Incitamenter 4/19/2018 INCITAMENTSPROBLEMER I FORBINDELSE MED DRIFTSFORBEDRINGER. Incitamentsproblem 1 Understøttes procesforbedringer

Transkript:

Basic statistics for experimental medical researchers Sample size calculations September 15th 2016 Christian Pipper Department of public health (IFSV) Faculty of Health and Medicinal Science (SUND) E-mail: pipper@sund.ku.dk (IFSV SUND) Basic statistics 1 / 10

Errors of statistical testing revisited The risk of type 1 error with one test Assume that H 0 is true (This is the assumption we make to calculate the p-value!) What is the probability of rejecting H 0 at a significance level α? type I-error (α/2) type I-error (α/2) 2 0 2 t s Conclusion: Significance level α = the risk of type 1 error with one test This has nothing to do with the size and design of our data! (IFSV SUND) Basic statistics 2 / 10

Errors of statistical testing revisited The risk of type 2 error with one test Assume the H 0 is false (in which case the alternative H A is true) What is the probability of accepting H 0 at a significance level α? power type 2 error (β) 0 2 η t s This depends crucially on the size and design of your data! Conclusion The Power 1 β = the risk of not committing a type 2 error (IFSV SUND) Basic statistics 3 / 10

Some intuition about the errors of statistical testing H 0 is rejected: a strong statement because H 0 is actually very likely to be false Since we are only committing a type 1 error (which is not very likely) H 0 is accepted: a somewhat weaker statement because H 0 is true If we are not committing a type 2 error that is (We honestly don t know how large that risk is) It could be a mere question of not having enough data Consequently this is something we need to address in the design phase of our study (IFSV SUND) Basic statistics 4 / 10

Test statistic behaviour under the alternative What is the distribution of the t test statistic t s under H 0? t-fordeling (small samples) Standard normal N(0,1) (not small samples) What is the distribution of t s under H A? (for not small samples) Normal distribution with mean η and standard deviation 1: N(η, 1) η = µ 1 µ 2 (1) σ 2 1 /n 1 + σ 2 2 /n 2 Example: From a pilot study we make a qualified guess that: mean difference µ1 µ 2 = 0.2 ; standard deviation σ 1 = σ 2 = 0.2 Mean of ts (insert into (1) and assume that n 1 = n 2 = n): η = 0.2 2 0.2 n (IFSV SUND) Basic statistics 5 / 10

Sample size calculation Ingredients Knowing mean difference and standard deviations we can determine η as a function of sample size for a given sample size and significance level we can thus determine the power, that is, the risk of rejecting H 0 under that alternative We can also go the other way to determine the sample size to obtain a given power (typically 80%) Goal: to find n so that under the given alternative H 0 is accepted with probability 1-power when we evaluate the p-value at a given significance level (IFSV SUND) Basic statistics 6 / 10

Sample size calculation in R Example continued Use the function power.t.test() plug in alternative in terms of mean difference (delta=0.2) and standard deviation (sd=0.2). plug in power (power=0.8) plug in significance level (sig.level=0.05) R code > power.t.test(delta=0.2,sd=0.2,power=0.8,sig.level=0.05) Two-sample t test power calculation n = 16.71477<---sample size delta = 0.2 sd = 0.2 sig.level = 0.05 power = 0.8 alternative = two.sided NOTE: n is number in *each* group (IFSV SUND) Basic statistics 7 / 10

Minimum detectable effect size Goal: to find the smallest mean difference for a given sample size and standard deviation so that H 0 is accepted with probability 1-power when we evaluate the p-value at a given significance level A feasibility calculation if you have restrictions on how large your sample size can be. R code > power.t.test(n=10,sd=0.2,power=0.8,sig.level=0.05) Two-sample t test power calculation n = 10 delta = 0.2649891 <---minimum detectable effect size sd = 0.2 sig.level = 0.05 power = 0.8 alternative = two.sided NOTE: n is number in *each* group (IFSV SUND) Basic statistics 8 / 10

The merits of sample size calculations A priori choices: Significance level and power (Chosen to control type 1 and 2 errors) Specific alternative in terms of values of µ 1 µ 2 and σ (Known from litterature or previous studies) A note on wishful thinking: Often the specific alternative is based on uninformed guessing rather than hard facts In such cases the sample size calculations should be used with caution The best thing you can do irrespective of any power calculation is to sample as much data as possible (IFSV SUND) Basic statistics 9 / 10

R-tutorial Execute the following R-code line by line and try to figure out what the code produces. #My 1000 t-test values my.t.test<-rt(1000,50) my.t.test[1:10] #They are t-distributed hist(my.t.test,prob=t) density<-dt((-10000:10000)/1000,50) points(y=density,x=(-10000:10000)/1000,type="l") #My corresponding p-values pvals<-2*pt(abs(my.t.test),50,lower.tail=f) hist(pvals,breaks=seq(0,1,by=0.05)) #Type I error: How many p-values are less than 5%: approx 5% length(pvals[pvals<0.05]) #Find sample size at a given relative effect size (mean difference/standard deviation), # power, and significance level power.t.test(delta=0.2/0.2,power=0.8,sig.level=0.05) #Find power at given relative effect size, sample-size, and significance level power.t.test(n=50,delta=0.2/0.2,sig.level=0.05) #Find the smallest detectable mean difference with #and significance level power.t.test(n=50,sd=0.2,power=0.8,sig.level=0.05) a given sd, sample size, power, (IFSV SUND) Basic statistics 10 / 10