editions59ddd.com
59ddd.com 时间:2021-03-20 阅读:(
)
R&DConnectionsNovember2004TestingandTimeLimitsBrentBridgeman,AmandaMcBride,&WilliamMonaghanTestingandtimelimits.
It'sanalmostinevitableunion—andforgoodreason,manywouldargue.
Imposingtimelimitsontestscanservearangeofimportantfunctions.
Timelimitsareessential,forexample,ifspeedofperformanceisanintegralcomponentofwhatisbeingmeasured,aswouldbethecasewhentestingsuchskillsashowquicklysomeonecantype.
Limitingtestingtimealsohelpscontainexpensesassociatedwithtestadministrations,suchaspayinghourlyfeesforproctorsinapaper-basedadministrationorforseattimeatcomputertestingcenters.
Butlimitingtestingtimetoodrasticallycanthreatenatest'svalidity,ortheabilityofthetesttoaccuratelyreflectwhatthetestwasdesignedtomeasure.
Thisisparticularlytrueifthetestisnotintendedtomeasurehowquicklythetesttakercananswerquestionsorifthetestingtimeissolimitedthatalargenumberofexamineestakingthetestcannotcompleteit;thatis,ifthetestis"speeded.
"Speedednessintestingreferstotheeffectthattimelimitshaveontesttakers'scores.
Whenatest'stimelimitsareconstrainedtothepointthatmosttesttakersdonothaveenoughtimetoconsiderandanswereachquestion,thetestissaidtobe"speeded.
"Atestisspeededtotheextentthatthosetakingitscorelowerthantheywouldhaveiftheyhadbeengivenanunlimitedamountoftimetocompleteit.
FortestssuchastheGREandCollegeBoard'sSAT,whichareintendedtomeasureskillsrelatedtoacademicabilityratherthantherateatwhichexamineescanwork,thespeedatwhichtesttakersanswerthequestionsshouldplayaminorrole,atmost,indeterminingtestscores(Briel,O'Neill,&Scheuneman,1993;Donlon,1984).
Consequently,timelimitsforsuchtestsshouldgivemosttesttakersenoughtimetofinishthetest,andamodesttimeextensionshouldhavearelativelysmalleffectonoveralltestscores(Bridgeman,Cline,&Hessinger,2003).
Whileit'spossiblethattimelimitscanaffectthescoresofalltesttakers,somehavesuggestedthatsuchlimitsmaydifferentiallyaffectfemaleandminoritytesttakers.
Someclaimthatthe"fast-paced,orspeedednature"oftheSATputsfemaletesttakersatadisadvantageoncertaintestsectionsbecausetheyapproachproblem-solvingdifferentlythantheirmalecounterparts—femaletesttakers,theysay,aremorelikelytoworkproblemsoutcompletely,toconsidermorethanonepossibleanswer,andtochecktheirwork(Becker,1990;Linn,1992).
Othershavenotedwhatseemstobeacommonbeliefamongtesttakersandtheirfamilies(andevenamongsomeschoolcounselors)thatgivingexamineesmoretimetocompleteatestcouldsubstantiallyimprovetheirscores.
Thishasraisedconcernsoverthepossibilitythatnondisabledstudentsmayattempttoobtainextended-timeaccommodations(whichETSprovidestoexamineeswithdocumenteddisabilitiesthatrequireadditionaltestingtime,suchaslearningdisabilities,Attention-Deficit/HyperactivityDisorder,orsightproblems),andthusgainaperceivedadvantageonstandardizedtests(Bridgeman,Trapani,&Curley,2003;Mandinach,Cahalan,&Camara,2002).
Butifevidencesuggeststhatextratimedoesnotimprovetesttakerperformance,studentswouldhavelittleornomotivationtomanipulatethesystemtoreceiveextratest-takingtimethatthey'renotentitledto.
AndtherewouldbelessListening.
Learning.
Leading.
reasontoflag1thescoresofstudentswhoweregrantedextendedtime,apracticethathasengenderedfiercedebatesinceitsimplementationdecadesago.
EffectofExtraTimeonSATTestScoresWithallthisinmind,theobviousquestionsseemtobe,whathappenswhentesttakersaregivenmoretimetocompleteastandardizedtestDotesttakers'scoresimprovewhentheyaregivenmoretimeAndifso,byhowmuchTobegintoanswerthesequestions,Bridgeman,Trapani,andCurley(2003)placedSATReasoningTestsectionswithafewernumberofquestionsintothestandard30-minutevariablesectionoftwonationaltestadministrations.
Thissectiondoesnotcounttowardthefinalscoresoftesttakers,butisusedtotryoutnewquestionsandtoensurethatscoresonneweditionsofthetestarecomparabletothoseonearliereditions.
Theresearcherscreatedthereducednumbersectionsbydeletingquestionsfromaverbalsectionthatcontained35questions,toproducetwosetsofforms,onewith27questionsandanotherwith23.
Thescoresonthe23questionscouldthenbecomparedtothescoresonthesame1"Flagging"referstothepracticebywhichadministratorsofstandardizedtestsplaceasterisksorothersimilarnotationsonthescorereportsofpeoplewithdisabilitieswhotakeexamsundercertainnonstandardconditions.
Theseconditionsusuallyinvolveanaccommodationonoramodificationtothetestandmayincludeprovidingpeopletoreadthetestinstructionsandquestionsaloud,large-printandBrailleformsofthetest,individualizedadministration,orextendedtime.
Accommodationsareintendedtoeliminateirrelevantsourcesofdifficultythatarerelatedtothedisabilitybutnottotheconstructbeingassessed.
It'sworthnotingthatthenumberofstudentsrequestingextratimehasgrownbyabout26percentoverthepastfiveyears(Camara,Copeland,&Rothschild,1998).
It'salsoimportanttonotethat,asofOct.
1,2001,ETSnolongerflagsscoresofteststhatwereadministeredunderanaccommodationofextendedtime.
23questionsinthesectionscontainingthe27or35questions.
Thiswasdoneforboththemathandtheverbalsectionsofthetest.
AscanbeseeninFigures1,2,3,and4,theresearchersfoundthatallowingmoretimeperquestion(theequivalentoftime-and-a-half)hadminimalimpactonverbalscores,producinggainsoflessthan10pointsonthe200-800SATscale.
Infact,inthefirststudy,scoresforthelowerabilitygroup(thosewhoscoredbelow400)actuallydecreasedwithextratime.
TheseresultssuggestthattheSATverbalsectionisonlyslightlyspeeded.
Themathsectionappearstobemorespeededthantheverbalsection,butnothighlyspeeded:Theequivalentoftime-and-a-halfraisedscoresabout20points,althoughtheincreasewassomewhatgreater(17-26points)forhigherabilitystudents(abilitylevel>600).
Forbothsections,increasingthetimetendedtobenefithigh-scoringstudentsmorethanlower-scoringstudents,withextratimecreatingnoincreaseinscoresforstudentswithSATscoresof400orlower(abilitylevel600>60030-Items30-Items25-Items25-Items35040045050055060065070060025-Items20-Items17-ItemsAbilityGroupsStudy135040045050055060065070060025-Items20-Items17-ItemsFigure3.
Meanscoreson17M1itemswithstandardtiming(embeddedina25-itemsection),andwithtwolessspeededconditions(embeddedina20-itemsectionandasacomplete17-itemsection).
AbilityGroupsStudy235040045050055060065070060025-Items22-Items35040045050055060065070060025-Items22-ItemsFigure4.
Meanscoreson22M2itemswithstandardtiming(embeddedina25-itemsection),andwithalessspeededcondition(acomplete22-itemsection).
Source:Bridgeman,Trapani,&Curley,2003.
AbilityGroupsStudy1AbilityGroupsStudy235040045050055060065070070060035-Items27-Items23-ItemsAbilityGroupsStudy1Figure1.
Meanscoreson23V1itemswithstandardtiming(embeddedina35-itemsection),andwithtwolessspeededconditions(embeddedina27-itemsectionandasacomplete23-itemsection).
35040045050055060065035-Items27-Items23-Items410-600>60060030-Items25-ItemsAbilityGroupsStudy1650600550500450400350AbilityGroupsStudy2Page3of6600Figure2.
Meanscoreson25M1itemswithstandardtiming(embeddedina30-itemsection),andwithalessspeededconditions(acomplete25-itemsection).
25-Items30-ItemsEffectofExtraTimeonQuantitativeandVerbalGREScoresAswiththeSAT,timelimitsfortheGREGeneralTestareintendedtobesetsothatmosttesttakerscancompletethetest.
Amodesttimeextension,then,shouldhavearelativelysmalleffectontestscores.
TheresultsfromtheSATstudy,however,cannotbeappliedtothecurrentcomputer-adaptiveGREGeneralTestbecauseofthecontentandtimingdifferencesofthetwotests,andbecauseofthedifferencesbetweencomputer-adaptivetesting(CAT)andpaper-basedadministration.
InaCAT,unlikepaper-basedtests,differentexamineesreceivedifferentsetsofquestions.
2UnlikemanyCATs,theGRECAThasafixednumberofquestionsandstricttimelimitsforeachsection,althoughitisnotintendedtobeaspeededtest.
ToinvestigatespeedednessandtheGRECAT,Bridgeman,Cline,andHessinger(2003)performedastudyinwhicharesearchsectionwasaddedtotheendofregularadministrationsoftheCATGRE.
VolunteerstookeitheraverbaloraquantitativeGREsectionwitheitherstandardtimingorone-and-a-halftimesthestandardtimelimit.
Toencouragemotivatedperformance,participantswereeligibleforacashpaymentiftheydidaswellontheexperimentalsectionastheydidontheoperationalsections.
2Incomputer-adaptivetesting,thecomputerselectstherangeofquestionsthatisappropriatetoeachtesttaker'sabilitylevel.
Testtakersreceiveasetofquestionsthatmeettestdesignspecificationsandgenerallyareappropriateforeachtesttaker'sperformancelevel.
Questionsarechosenfromalargepoolofpossiblequestionscategorizedbycontentanddifficulty.
(Thecontentandtypesofquestionsaresimilartothatfoundincomparablepaper-basedtests.
)Thecomputer-adaptiveteststartswithquestionsofmoderatedifficulty.
Asthecandidateanswerseachquestion,thecomputerscoresthequestionandusesthatinformation,aswellasthecandidate'sresponsestopreviousquestions,todeterminewhichquestionispresentednext.
Aslongasthetesttakerrespondscorrectly,thecomputertypicallyselectsanextquestionofgreaterdifficulty.
Incontrast,ifthetesttakeranswersaquestionincorrectly,thecomputertypicallyselectsanextquestionoflesserdifficulty.
Subsequentquestionsarepresentedbasedinpartonthetesttaker'sperformanceonpreviousquestionsandinpartonthetestdesign.
Inotherwords,thecomputerisprogrammedtofulfillthetestdesignasitcontinuouslyadjuststofindquestionsofappropriatedifficultyfortesttakersofallperformancelevels.
AsTables1and2show,resultsfromthisstudyindicatethatextratimehadaminimaleffectonoverallscores,addingonlyabout7pointstoverbalscoresand7pointstoquantitativescoresonthe200-800scorescale.
And,aswasthecaseintheSATstudy,scoresunderthedifferentconditionswerecomparableacrossgenderandethnicgroups,althoughquantitativescoreswereslightlyhigherforlowerabilityexamineeswhohadmoretime.
Note,however,thattherearesomeimportantdifferencesbetweentheSATandGRE.
TheSATsubtractsafractionofapointforeveryquestionthatisansweredincorrectly,sothatitisbettertoleaveaquestionunansweredthantogiveanincorrectanswer.
TheGRE,ontheotherhand,hasapenaltyforleavingquestionsunansweredattheend.
QuestionsontheSATarearrangedforthemostparttobecomesuccessivelymoredifficult.
Lowerabilitytesttakersaremorelikelytoguessandgiveincorrectanswerstothelattersetofquestions,resultinginanegativeeffectontheirscores.
However,thisisnottrueforsectionswithreadingpassages,whichmakeupthemajorityoftheverbaltest.
Orderofthoseitemsisdependentuponwherethetopicstheindividualitemsrefertoappearinthepassage.
OntheGRECAT,lowerabilitytesttakerswouldreceivequestionsatorclosetotheirabilityleveltowardtheendofthetest,lesseningtheirneedtoguess.
ImpactofTimeLimitsonComputer-AdaptiveTestsAsmentionedearlier,theGRECATisnotintendedtobeaspeededtest,buthasafixednumberofquestionsandsectiontimelimits.
Sowhathappenswhentimelimitsareimposedonteststhatgivedifferentquestionstodifferentexaminees,particularlyifquestionsthataresupposedtobeequallydifficulttendtohavesubstantialdifferencesinthetimeittakestoanswerthemBridgemanandCline(2000)foundthatsomeofthequestionsintheGRE'sanalyticalandPage4of6quantitativesectionscouldbeansweredmuchmorequicklythanothers.
Theresearchersalsonotedthatwhilesomeofthisvariationinresponsetimewasrelatedtothedifficultyofthequestions—moredifficultquestionstendedtotakelongertoanswerthanlessdifficultones—therealsowassubstantialvariationinthetimerequiredtoanswerquestionsofroughlythesamedifficultylevelandmeetingthesamecontentspecifications.
Table1SampleSizes,Means,andStandardDeviationsforResearchGREQuantitativeScoresTimingconditionStatisticStandard(45min.
)Extended(68min.
)Differencen3,9043,749M6646717SD125121Table2SampleSizes,Means,andStandardDeviationsforResearchGREVerbalScoresTimingconditionStatisticStandard(30min.
)Extended(45min.
)Differencen4,1974,098M4544617SD122120Source:Bridgeman,Cline,&Hessinger,2003.
Giventhesefindings,itseemedconceivablethatexamineesreceivingtime-consumingtests(i.
e.
,thosewhogetadisproportionatenumberofitemsthattakealonger-than-averagetimetoanswer)couldbedisadvantagedand,asaresult,receivelowerscorescomparedtotesttakerswhogetalesstime-consumingtest.
Yet,uponfurtherinvestigation,BridgemanandCline(2000)couldfindnoevidenceofimpactontotaltestscores.
Inarelatedstudy,however,BridgemanandCline(2004)didfindevidencethattesttakersontheanalyticalsectionoftheGREwereindeedaffectedbythiscombinationofconditions,whichresultedintesttakershavingtoguessonthefinalquestionsinordertofinishthetestbeforerunningoutoftime.
Testtakersatthehigherabilitylevelstendedtoguessmorethanthoseatthelowerabilitylevelsbecausethequestionsadministeredtohigherabilityexamineesweretypicallymoretime-consuming.
Sinceguessingincreasesthechancesofansweringitemsincorrectly(whichwouldloweratesttaker'sscore),thesefindingsindicatethatexamineeswhoareadministeredtestswithadisproportionatenumberoftime-consumingitemsarelikelytogetlowerscoresthanthoseofcomparableabilitywhoreceivetestscontainingitemsthatcanbeansweredmorequickly.
It'sworthnotingthattheGRE'sanalyticalsectionhasbeenreplacedbytwoessaypromptsthatassessanalyticalwritingskills.
Althoughthepotentialproblemnotedabovecontributedtothisdecision,itwasnottheonlyconsideration(Bridgeman&Cline,2004).
ImplicationsThisresearchindicatesthatindividualstakingeithertheSATortheverbalandmathsectionsoftheGRECAThavesufficienttimetoanswerthequestions.
Thesetestsarenotspeededtoanysignificantdegree,andgivingtesttakersmoretimetocompletetheseitemsdoesnotresultinsignificantscoregains.
Thescoregainsthatwereachieved(lessthan10pointsfortheverbalsectionandlessthan30pointsforthemathsection,ona200-800scale)wereextremelyminorandwouldcertainlynotmakeorbreakastudent'seducationalaspirations.
Moreover,scoregainswerenotconsistentacrossabilitylevels:Fortheseassessments,high-scoringtesttakerstendedtobenefitmorethanlower-scoringstudents,withextratimecreatingnoincreaseinscoresforstudentswithSATscoresof400orlower.
Furthermore,racial/ethnicandgenderdifferenceswereneitherincreasednorreducedwithextratime,challengingargumentsthattheso-called"speeded"natureoftheSATdisadvantagesminorityandfemaletesttakers.
TheseresultsshouldhelptoreducethemotivationforstudentswhoarenotdisabledtoPage5of6manipulatethesysteminanattempttoobtainunwarrantedextended-timeaccommodations.
Atthesametime,testusersshouldnotbeoverlyconcernedthatsomestudentsmightbegaininganunfairadvantageinthismanner,sinceanysuchadvantagewouldlikelybequitesmall.
StudieswereconflictingregardingwhetherornottheAnalyticsectionoftheGRECATwasspeeded.
Althoughthemostrecentstudy(Bridgeman&Cline,2004)makeastrongargumentthatthetestwasindeedspeeded,itisnowamootpointsinceETSnolongeradministersthissection.
However,theinformationobtainedinthisstudyshouldproveusefultodevelopingfutureCATswithstricttimelimits.
ReferencesBecker,B.
J.
(1990).
ItemcharacteristicsandgenderdifferencesontheSAT-Mformathematicallyableyouths.
AmericanEducationalResearchJournal,27,65-87.
Bridgeman,B.
(2004,April).
Speedednessasathreattoconstructvalidity.
PaperpresentedattheannualmeetingoftheNationalCouncilonMeasurementinEducation,SanDiego,CA.
RetrievedOct.
19,2004,fromtheETSWebsite:http://www.
ets.
org/research/dload/NCME_2004-Bridgeman.
pdfBridgeman,B.
&Cline,F.
(2004).
Effectsofdifferentiallytime-consumingtestsoncomputer-adaptivetestscores.
JournalofEducationalMeasurement,41,137-148.
Bridgeman,B.
,&Cline,F.
(2000).
Variationsinmeanresponsetimesforquestionsonthecomputer-adaptiveGREGeneralTest:Implicationsforfairassessment(ETSRR-00-7).
RetrievedOct.
19,2004,fromtheETSWebsite:http://ftp.
ets.
org/pub/res/researcher/RR-00-07-Bridgeman.
pdfBridgeman,B.
,Cline,F.
,&Hessinger,J.
(2003).
EffectofextratimeonGREQuantitativeandVerbalscores(ETSRR-03-13).
RetrievedOct.
19,2004,fromtheETSWebsite:http://ftp.
ets.
org/pub/res/researcher/RR-03-13-Bridgeman.
pdfBridgeman,B.
,Trapani,C.
,&Curley,E.
(2003).
EffectoffewerquestionspersectiononSATIscores(CollegeBoardReportNo.
2003-2).
RetrievedOct.
19,2004,fromtheCollegeBoardWebsite:http://www.
collegeboard.
com/research/pdf/rdcbreport20032web_23502.
pdfBriel,J.
B.
,O'Neill,K.
A.
,&Scheuneman,J.
D.
(1993).
GREtechnicalmanual.
Princeton,NJ:ETS.
Camara,W.
,Copeland,T.
,&Rothschild,B.
(1998).
EffectsofextendedtimeontheSAT:Reasoningtestscoregrowthforstudentswithlearningdisabilities(CollegeBoardReportNo.
98-7).
RetrievedOct.
19,2004,fromtheCollegeBoardWebsite:http://www.
collegeboard.
com/research/pdf/rr9807_3912.
pdfDonlon,T.
F.
(Ed.
).
(1984).
TheCollegeBoardtechnicalhandbookfortheScholasticAptitudeTestandAchievementTests.
NewYork:CollegeEntranceExaminationBoard.
Linn,M.
C.
(1992).
Genderdifferencesineducationalachievement.
InSexequityeducationalopportunity,achievement,andtesting:Proceedingsofthe1991ETSInvitationalConference(pp.
11–50).
Princeton,NJ:ETS.
Mandinach,E.
,Cahalan,C.
,&Camara,W.
(2002).
Theimpactofflaggingontheadmissionprocess:Policies,practices,andimplications(CollegeBoardReportNo.
2002-2).
RetrievedOct.
19,2004,fromtheCollegeBoardWebsite:http://www.
collegeboard.
com/research/pdf/02595020604txtcvr_11433.
pdfR&DConnectionsispublishedbyETSResearch&DevelopmentEducationalTestingServiceRosedaleRoad,19-TPrinceton,NJ08541-0001SendcommentsaboutthispublicationtotheaboveaddressorviatheWebat:http://www.
ets.
org/research/contact.
htmlCopyright2004byEducationalTestingService.
Allrightsreserved.
EducationalTestingServiceisanAffirmativeAction/EqualOpportunityEmployer.
EducationalTestingService,ETS,andtheETSlogoGraduateRecordExaminations,andGREareregisteredtrademarksofEducationalTestingService.
CollegeBoardandSATareregisteredtrademarksoftheCollegeEntranceExaminationBoard.
SATReasoningTestisatrademarkoftheCollegeEntranceExaminationBoard.
Listening.
Learning.
Leading.
Page6of6
iON Cloud怎么样?iON Cloud是Krypt旗下的云服务器品牌,成立于2019年,是美国老牌机房(1998~)krypt旗下的VPS云服务器品牌,主打国外VPS云服务器业务,均采用KVM架构,整体性能配置较高,云服务器产品质量靠谱,在线率高,国内直连线路,适合建站等用途,支付宝、微信付款购买。支持Windows server 2012、2016、2019中英文版本以及主流Linux发行...
修罗云怎么样?修罗云是一家国内老牌商家,修罗云商家以销售NAT机器起家,国内的中转机相当不错,给的带宽都非常高,此前推荐的也都是国内NAT VPS机器。今天,云服务器网(www.yuntue.com)小编主要介绍一下修罗云的香港云服务器,适合建站,香港沙田cn2云服务器,2核2G,5M带宽仅70元/月起,同时香港香港大带宽NAT VPS低至50元/月起,性价比不错,可以尝试一下!点击进入:修罗云官...
今天上午有网友在群里聊到是不是有新注册域名的海外域名商家的优惠活动。如果我们并非一定要在国外注册域名的话,最近年中促销期间,国内的服务商优惠力度还是比较大的,以前我们可能较多选择海外域名商家注册域名在于海外商家便宜,如今这几年国内的商家价格也不贵的。比如在前一段时间有分享到几个商家的年中活动:1、DNSPOD域名欢购活动 - 提供域名抢购活动、DNS解析折扣、SSL证书活动2、难得再次关注新网商家...
59ddd.com为你推荐
外挂购买自动充值软件中老铁路老挝磨丁经济特区的前景如何?18comic.fun有什么好玩的网站lunwenjiancewritecheck论文检测准吗?杰景新特萨克斯吉普特500是台湾原产的吗www.622hh.comwww.710av.com怎么不可以看了5xoy.com求个如月群真汉化版下载地址partnersonline电脑内一切浏览器无法打开555sss.com不能在线播放了??555javlibrary.com大家有没有在线图书馆WWW。QUESTIA。COM的免费帐号
安徽双线服务器租用 域名查询软件 阿里云os simcentric 外贸主机 宕机监控 godaddy优惠券 免费ddos防火墙 空间服务商 一元域名 100x100头像 秒杀预告 asp免费空间申请 美国在线代理服务器 t云 如何注册阿里云邮箱 流媒体加速 闪讯官网 个人免费邮箱 xshell5注册码 更多