editions59ddd.com
59ddd.com 时间:2021-03-20 阅读:(
)
R&DConnectionsNovember2004TestingandTimeLimitsBrentBridgeman,AmandaMcBride,&WilliamMonaghanTestingandtimelimits.
It'sanalmostinevitableunion—andforgoodreason,manywouldargue.
Imposingtimelimitsontestscanservearangeofimportantfunctions.
Timelimitsareessential,forexample,ifspeedofperformanceisanintegralcomponentofwhatisbeingmeasured,aswouldbethecasewhentestingsuchskillsashowquicklysomeonecantype.
Limitingtestingtimealsohelpscontainexpensesassociatedwithtestadministrations,suchaspayinghourlyfeesforproctorsinapaper-basedadministrationorforseattimeatcomputertestingcenters.
Butlimitingtestingtimetoodrasticallycanthreatenatest'svalidity,ortheabilityofthetesttoaccuratelyreflectwhatthetestwasdesignedtomeasure.
Thisisparticularlytrueifthetestisnotintendedtomeasurehowquicklythetesttakercananswerquestionsorifthetestingtimeissolimitedthatalargenumberofexamineestakingthetestcannotcompleteit;thatis,ifthetestis"speeded.
"Speedednessintestingreferstotheeffectthattimelimitshaveontesttakers'scores.
Whenatest'stimelimitsareconstrainedtothepointthatmosttesttakersdonothaveenoughtimetoconsiderandanswereachquestion,thetestissaidtobe"speeded.
"Atestisspeededtotheextentthatthosetakingitscorelowerthantheywouldhaveiftheyhadbeengivenanunlimitedamountoftimetocompleteit.
FortestssuchastheGREandCollegeBoard'sSAT,whichareintendedtomeasureskillsrelatedtoacademicabilityratherthantherateatwhichexamineescanwork,thespeedatwhichtesttakersanswerthequestionsshouldplayaminorrole,atmost,indeterminingtestscores(Briel,O'Neill,&Scheuneman,1993;Donlon,1984).
Consequently,timelimitsforsuchtestsshouldgivemosttesttakersenoughtimetofinishthetest,andamodesttimeextensionshouldhavearelativelysmalleffectonoveralltestscores(Bridgeman,Cline,&Hessinger,2003).
Whileit'spossiblethattimelimitscanaffectthescoresofalltesttakers,somehavesuggestedthatsuchlimitsmaydifferentiallyaffectfemaleandminoritytesttakers.
Someclaimthatthe"fast-paced,orspeedednature"oftheSATputsfemaletesttakersatadisadvantageoncertaintestsectionsbecausetheyapproachproblem-solvingdifferentlythantheirmalecounterparts—femaletesttakers,theysay,aremorelikelytoworkproblemsoutcompletely,toconsidermorethanonepossibleanswer,andtochecktheirwork(Becker,1990;Linn,1992).
Othershavenotedwhatseemstobeacommonbeliefamongtesttakersandtheirfamilies(andevenamongsomeschoolcounselors)thatgivingexamineesmoretimetocompleteatestcouldsubstantiallyimprovetheirscores.
Thishasraisedconcernsoverthepossibilitythatnondisabledstudentsmayattempttoobtainextended-timeaccommodations(whichETSprovidestoexamineeswithdocumenteddisabilitiesthatrequireadditionaltestingtime,suchaslearningdisabilities,Attention-Deficit/HyperactivityDisorder,orsightproblems),andthusgainaperceivedadvantageonstandardizedtests(Bridgeman,Trapani,&Curley,2003;Mandinach,Cahalan,&Camara,2002).
Butifevidencesuggeststhatextratimedoesnotimprovetesttakerperformance,studentswouldhavelittleornomotivationtomanipulatethesystemtoreceiveextratest-takingtimethatthey'renotentitledto.
AndtherewouldbelessListening.
Learning.
Leading.
reasontoflag1thescoresofstudentswhoweregrantedextendedtime,apracticethathasengenderedfiercedebatesinceitsimplementationdecadesago.
EffectofExtraTimeonSATTestScoresWithallthisinmind,theobviousquestionsseemtobe,whathappenswhentesttakersaregivenmoretimetocompleteastandardizedtestDotesttakers'scoresimprovewhentheyaregivenmoretimeAndifso,byhowmuchTobegintoanswerthesequestions,Bridgeman,Trapani,andCurley(2003)placedSATReasoningTestsectionswithafewernumberofquestionsintothestandard30-minutevariablesectionoftwonationaltestadministrations.
Thissectiondoesnotcounttowardthefinalscoresoftesttakers,butisusedtotryoutnewquestionsandtoensurethatscoresonneweditionsofthetestarecomparabletothoseonearliereditions.
Theresearcherscreatedthereducednumbersectionsbydeletingquestionsfromaverbalsectionthatcontained35questions,toproducetwosetsofforms,onewith27questionsandanotherwith23.
Thescoresonthe23questionscouldthenbecomparedtothescoresonthesame1"Flagging"referstothepracticebywhichadministratorsofstandardizedtestsplaceasterisksorothersimilarnotationsonthescorereportsofpeoplewithdisabilitieswhotakeexamsundercertainnonstandardconditions.
Theseconditionsusuallyinvolveanaccommodationonoramodificationtothetestandmayincludeprovidingpeopletoreadthetestinstructionsandquestionsaloud,large-printandBrailleformsofthetest,individualizedadministration,orextendedtime.
Accommodationsareintendedtoeliminateirrelevantsourcesofdifficultythatarerelatedtothedisabilitybutnottotheconstructbeingassessed.
It'sworthnotingthatthenumberofstudentsrequestingextratimehasgrownbyabout26percentoverthepastfiveyears(Camara,Copeland,&Rothschild,1998).
It'salsoimportanttonotethat,asofOct.
1,2001,ETSnolongerflagsscoresofteststhatwereadministeredunderanaccommodationofextendedtime.
23questionsinthesectionscontainingthe27or35questions.
Thiswasdoneforboththemathandtheverbalsectionsofthetest.
AscanbeseeninFigures1,2,3,and4,theresearchersfoundthatallowingmoretimeperquestion(theequivalentoftime-and-a-half)hadminimalimpactonverbalscores,producinggainsoflessthan10pointsonthe200-800SATscale.
Infact,inthefirststudy,scoresforthelowerabilitygroup(thosewhoscoredbelow400)actuallydecreasedwithextratime.
TheseresultssuggestthattheSATverbalsectionisonlyslightlyspeeded.
Themathsectionappearstobemorespeededthantheverbalsection,butnothighlyspeeded:Theequivalentoftime-and-a-halfraisedscoresabout20points,althoughtheincreasewassomewhatgreater(17-26points)forhigherabilitystudents(abilitylevel>600).
Forbothsections,increasingthetimetendedtobenefithigh-scoringstudentsmorethanlower-scoringstudents,withextratimecreatingnoincreaseinscoresforstudentswithSATscoresof400orlower(abilitylevel600>60030-Items30-Items25-Items25-Items35040045050055060065070060025-Items20-Items17-ItemsAbilityGroupsStudy135040045050055060065070060025-Items20-Items17-ItemsFigure3.
Meanscoreson17M1itemswithstandardtiming(embeddedina25-itemsection),andwithtwolessspeededconditions(embeddedina20-itemsectionandasacomplete17-itemsection).
AbilityGroupsStudy235040045050055060065070060025-Items22-Items35040045050055060065070060025-Items22-ItemsFigure4.
Meanscoreson22M2itemswithstandardtiming(embeddedina25-itemsection),andwithalessspeededcondition(acomplete22-itemsection).
Source:Bridgeman,Trapani,&Curley,2003.
AbilityGroupsStudy1AbilityGroupsStudy235040045050055060065070070060035-Items27-Items23-ItemsAbilityGroupsStudy1Figure1.
Meanscoreson23V1itemswithstandardtiming(embeddedina35-itemsection),andwithtwolessspeededconditions(embeddedina27-itemsectionandasacomplete23-itemsection).
35040045050055060065035-Items27-Items23-Items410-600>60060030-Items25-ItemsAbilityGroupsStudy1650600550500450400350AbilityGroupsStudy2Page3of6600Figure2.
Meanscoreson25M1itemswithstandardtiming(embeddedina30-itemsection),andwithalessspeededconditions(acomplete25-itemsection).
25-Items30-ItemsEffectofExtraTimeonQuantitativeandVerbalGREScoresAswiththeSAT,timelimitsfortheGREGeneralTestareintendedtobesetsothatmosttesttakerscancompletethetest.
Amodesttimeextension,then,shouldhavearelativelysmalleffectontestscores.
TheresultsfromtheSATstudy,however,cannotbeappliedtothecurrentcomputer-adaptiveGREGeneralTestbecauseofthecontentandtimingdifferencesofthetwotests,andbecauseofthedifferencesbetweencomputer-adaptivetesting(CAT)andpaper-basedadministration.
InaCAT,unlikepaper-basedtests,differentexamineesreceivedifferentsetsofquestions.
2UnlikemanyCATs,theGRECAThasafixednumberofquestionsandstricttimelimitsforeachsection,althoughitisnotintendedtobeaspeededtest.
ToinvestigatespeedednessandtheGRECAT,Bridgeman,Cline,andHessinger(2003)performedastudyinwhicharesearchsectionwasaddedtotheendofregularadministrationsoftheCATGRE.
VolunteerstookeitheraverbaloraquantitativeGREsectionwitheitherstandardtimingorone-and-a-halftimesthestandardtimelimit.
Toencouragemotivatedperformance,participantswereeligibleforacashpaymentiftheydidaswellontheexperimentalsectionastheydidontheoperationalsections.
2Incomputer-adaptivetesting,thecomputerselectstherangeofquestionsthatisappropriatetoeachtesttaker'sabilitylevel.
Testtakersreceiveasetofquestionsthatmeettestdesignspecificationsandgenerallyareappropriateforeachtesttaker'sperformancelevel.
Questionsarechosenfromalargepoolofpossiblequestionscategorizedbycontentanddifficulty.
(Thecontentandtypesofquestionsaresimilartothatfoundincomparablepaper-basedtests.
)Thecomputer-adaptiveteststartswithquestionsofmoderatedifficulty.
Asthecandidateanswerseachquestion,thecomputerscoresthequestionandusesthatinformation,aswellasthecandidate'sresponsestopreviousquestions,todeterminewhichquestionispresentednext.
Aslongasthetesttakerrespondscorrectly,thecomputertypicallyselectsanextquestionofgreaterdifficulty.
Incontrast,ifthetesttakeranswersaquestionincorrectly,thecomputertypicallyselectsanextquestionoflesserdifficulty.
Subsequentquestionsarepresentedbasedinpartonthetesttaker'sperformanceonpreviousquestionsandinpartonthetestdesign.
Inotherwords,thecomputerisprogrammedtofulfillthetestdesignasitcontinuouslyadjuststofindquestionsofappropriatedifficultyfortesttakersofallperformancelevels.
AsTables1and2show,resultsfromthisstudyindicatethatextratimehadaminimaleffectonoverallscores,addingonlyabout7pointstoverbalscoresand7pointstoquantitativescoresonthe200-800scorescale.
And,aswasthecaseintheSATstudy,scoresunderthedifferentconditionswerecomparableacrossgenderandethnicgroups,althoughquantitativescoreswereslightlyhigherforlowerabilityexamineeswhohadmoretime.
Note,however,thattherearesomeimportantdifferencesbetweentheSATandGRE.
TheSATsubtractsafractionofapointforeveryquestionthatisansweredincorrectly,sothatitisbettertoleaveaquestionunansweredthantogiveanincorrectanswer.
TheGRE,ontheotherhand,hasapenaltyforleavingquestionsunansweredattheend.
QuestionsontheSATarearrangedforthemostparttobecomesuccessivelymoredifficult.
Lowerabilitytesttakersaremorelikelytoguessandgiveincorrectanswerstothelattersetofquestions,resultinginanegativeeffectontheirscores.
However,thisisnottrueforsectionswithreadingpassages,whichmakeupthemajorityoftheverbaltest.
Orderofthoseitemsisdependentuponwherethetopicstheindividualitemsrefertoappearinthepassage.
OntheGRECAT,lowerabilitytesttakerswouldreceivequestionsatorclosetotheirabilityleveltowardtheendofthetest,lesseningtheirneedtoguess.
ImpactofTimeLimitsonComputer-AdaptiveTestsAsmentionedearlier,theGRECATisnotintendedtobeaspeededtest,buthasafixednumberofquestionsandsectiontimelimits.
Sowhathappenswhentimelimitsareimposedonteststhatgivedifferentquestionstodifferentexaminees,particularlyifquestionsthataresupposedtobeequallydifficulttendtohavesubstantialdifferencesinthetimeittakestoanswerthemBridgemanandCline(2000)foundthatsomeofthequestionsintheGRE'sanalyticalandPage4of6quantitativesectionscouldbeansweredmuchmorequicklythanothers.
Theresearchersalsonotedthatwhilesomeofthisvariationinresponsetimewasrelatedtothedifficultyofthequestions—moredifficultquestionstendedtotakelongertoanswerthanlessdifficultones—therealsowassubstantialvariationinthetimerequiredtoanswerquestionsofroughlythesamedifficultylevelandmeetingthesamecontentspecifications.
Table1SampleSizes,Means,andStandardDeviationsforResearchGREQuantitativeScoresTimingconditionStatisticStandard(45min.
)Extended(68min.
)Differencen3,9043,749M6646717SD125121Table2SampleSizes,Means,andStandardDeviationsforResearchGREVerbalScoresTimingconditionStatisticStandard(30min.
)Extended(45min.
)Differencen4,1974,098M4544617SD122120Source:Bridgeman,Cline,&Hessinger,2003.
Giventhesefindings,itseemedconceivablethatexamineesreceivingtime-consumingtests(i.
e.
,thosewhogetadisproportionatenumberofitemsthattakealonger-than-averagetimetoanswer)couldbedisadvantagedand,asaresult,receivelowerscorescomparedtotesttakerswhogetalesstime-consumingtest.
Yet,uponfurtherinvestigation,BridgemanandCline(2000)couldfindnoevidenceofimpactontotaltestscores.
Inarelatedstudy,however,BridgemanandCline(2004)didfindevidencethattesttakersontheanalyticalsectionoftheGREwereindeedaffectedbythiscombinationofconditions,whichresultedintesttakershavingtoguessonthefinalquestionsinordertofinishthetestbeforerunningoutoftime.
Testtakersatthehigherabilitylevelstendedtoguessmorethanthoseatthelowerabilitylevelsbecausethequestionsadministeredtohigherabilityexamineesweretypicallymoretime-consuming.
Sinceguessingincreasesthechancesofansweringitemsincorrectly(whichwouldloweratesttaker'sscore),thesefindingsindicatethatexamineeswhoareadministeredtestswithadisproportionatenumberoftime-consumingitemsarelikelytogetlowerscoresthanthoseofcomparableabilitywhoreceivetestscontainingitemsthatcanbeansweredmorequickly.
It'sworthnotingthattheGRE'sanalyticalsectionhasbeenreplacedbytwoessaypromptsthatassessanalyticalwritingskills.
Althoughthepotentialproblemnotedabovecontributedtothisdecision,itwasnottheonlyconsideration(Bridgeman&Cline,2004).
ImplicationsThisresearchindicatesthatindividualstakingeithertheSATortheverbalandmathsectionsoftheGRECAThavesufficienttimetoanswerthequestions.
Thesetestsarenotspeededtoanysignificantdegree,andgivingtesttakersmoretimetocompletetheseitemsdoesnotresultinsignificantscoregains.
Thescoregainsthatwereachieved(lessthan10pointsfortheverbalsectionandlessthan30pointsforthemathsection,ona200-800scale)wereextremelyminorandwouldcertainlynotmakeorbreakastudent'seducationalaspirations.
Moreover,scoregainswerenotconsistentacrossabilitylevels:Fortheseassessments,high-scoringtesttakerstendedtobenefitmorethanlower-scoringstudents,withextratimecreatingnoincreaseinscoresforstudentswithSATscoresof400orlower.
Furthermore,racial/ethnicandgenderdifferenceswereneitherincreasednorreducedwithextratime,challengingargumentsthattheso-called"speeded"natureoftheSATdisadvantagesminorityandfemaletesttakers.
TheseresultsshouldhelptoreducethemotivationforstudentswhoarenotdisabledtoPage5of6manipulatethesysteminanattempttoobtainunwarrantedextended-timeaccommodations.
Atthesametime,testusersshouldnotbeoverlyconcernedthatsomestudentsmightbegaininganunfairadvantageinthismanner,sinceanysuchadvantagewouldlikelybequitesmall.
StudieswereconflictingregardingwhetherornottheAnalyticsectionoftheGRECATwasspeeded.
Althoughthemostrecentstudy(Bridgeman&Cline,2004)makeastrongargumentthatthetestwasindeedspeeded,itisnowamootpointsinceETSnolongeradministersthissection.
However,theinformationobtainedinthisstudyshouldproveusefultodevelopingfutureCATswithstricttimelimits.
ReferencesBecker,B.
J.
(1990).
ItemcharacteristicsandgenderdifferencesontheSAT-Mformathematicallyableyouths.
AmericanEducationalResearchJournal,27,65-87.
Bridgeman,B.
(2004,April).
Speedednessasathreattoconstructvalidity.
PaperpresentedattheannualmeetingoftheNationalCouncilonMeasurementinEducation,SanDiego,CA.
RetrievedOct.
19,2004,fromtheETSWebsite:http://www.
ets.
org/research/dload/NCME_2004-Bridgeman.
pdfBridgeman,B.
&Cline,F.
(2004).
Effectsofdifferentiallytime-consumingtestsoncomputer-adaptivetestscores.
JournalofEducationalMeasurement,41,137-148.
Bridgeman,B.
,&Cline,F.
(2000).
Variationsinmeanresponsetimesforquestionsonthecomputer-adaptiveGREGeneralTest:Implicationsforfairassessment(ETSRR-00-7).
RetrievedOct.
19,2004,fromtheETSWebsite:http://ftp.
ets.
org/pub/res/researcher/RR-00-07-Bridgeman.
pdfBridgeman,B.
,Cline,F.
,&Hessinger,J.
(2003).
EffectofextratimeonGREQuantitativeandVerbalscores(ETSRR-03-13).
RetrievedOct.
19,2004,fromtheETSWebsite:http://ftp.
ets.
org/pub/res/researcher/RR-03-13-Bridgeman.
pdfBridgeman,B.
,Trapani,C.
,&Curley,E.
(2003).
EffectoffewerquestionspersectiononSATIscores(CollegeBoardReportNo.
2003-2).
RetrievedOct.
19,2004,fromtheCollegeBoardWebsite:http://www.
collegeboard.
com/research/pdf/rdcbreport20032web_23502.
pdfBriel,J.
B.
,O'Neill,K.
A.
,&Scheuneman,J.
D.
(1993).
GREtechnicalmanual.
Princeton,NJ:ETS.
Camara,W.
,Copeland,T.
,&Rothschild,B.
(1998).
EffectsofextendedtimeontheSAT:Reasoningtestscoregrowthforstudentswithlearningdisabilities(CollegeBoardReportNo.
98-7).
RetrievedOct.
19,2004,fromtheCollegeBoardWebsite:http://www.
collegeboard.
com/research/pdf/rr9807_3912.
pdfDonlon,T.
F.
(Ed.
).
(1984).
TheCollegeBoardtechnicalhandbookfortheScholasticAptitudeTestandAchievementTests.
NewYork:CollegeEntranceExaminationBoard.
Linn,M.
C.
(1992).
Genderdifferencesineducationalachievement.
InSexequityeducationalopportunity,achievement,andtesting:Proceedingsofthe1991ETSInvitationalConference(pp.
11–50).
Princeton,NJ:ETS.
Mandinach,E.
,Cahalan,C.
,&Camara,W.
(2002).
Theimpactofflaggingontheadmissionprocess:Policies,practices,andimplications(CollegeBoardReportNo.
2002-2).
RetrievedOct.
19,2004,fromtheCollegeBoardWebsite:http://www.
collegeboard.
com/research/pdf/02595020604txtcvr_11433.
pdfR&DConnectionsispublishedbyETSResearch&DevelopmentEducationalTestingServiceRosedaleRoad,19-TPrinceton,NJ08541-0001SendcommentsaboutthispublicationtotheaboveaddressorviatheWebat:http://www.
ets.
org/research/contact.
htmlCopyright2004byEducationalTestingService.
Allrightsreserved.
EducationalTestingServiceisanAffirmativeAction/EqualOpportunityEmployer.
EducationalTestingService,ETS,andtheETSlogoGraduateRecordExaminations,andGREareregisteredtrademarksofEducationalTestingService.
CollegeBoardandSATareregisteredtrademarksoftheCollegeEntranceExaminationBoard.
SATReasoningTestisatrademarkoftheCollegeEntranceExaminationBoard.
Listening.
Learning.
Leading.
Page6of6
特网云官網特网云服务器在硬件级别上实现云主机之间的完全隔离;采用高端服务器进行部署,同时采用集中的管理与监控,确保业务稳定可靠,搭建纯SSD架构的高性能企业级云服务器,同时采用Intel Haswell CPU、高频DDR4内存、高速Sas3 SSD闪存作为底层硬件配置,分钟级响应速度,特网云采用自带硬防节点,部分节点享免费20G防御,可实现300G防御峰值,有效防御DDoS、CC等恶意攻击,保障...
CloudCone在月初发了个邮件,表示上新了一个系列VPS主机,采用SSD缓存磁盘,支持下单购买额外的CPU、内存和硬盘资源,最低年付17.99美元起。CloudCone成立于2017年,提供VPS和独立服务器租用,深耕洛杉矶MC机房,最初提供按小时计费随时退回,给自己弄回一大堆中国不能访问的IP,现在已经取消了随时删除了,不过他的VPS主机价格不贵,支持购买额外IP,还支持购买高防IP。下面列...
欧路云怎么样?欧路云主要运行弹性云服务器,可自由定制配置,可选加拿大的480G超高防系列,也可以选择美国(200G高防)系列,也有速度直逼内地的香港CN2系列。所有配置都可以在下单的时候自行根据项目 需求来定制自由升级降级 (降级按天数配置费用 退款回预存款)。2021年7月14日美国 CERA 弹性云服务器 上新 联通CUVIP 线路!8折特惠中!点击进入:欧路云官方网站地址付款方式:PayPa...
59ddd.com为你推荐
酒店回应名媛拼单名媛一天到晚都做什么?中老铁路老挝磨丁经济特区的前景如何?www.bbb551.combbb是什么意思66smsm.com【回家的欲望(回家的诱惑)大结局】 回家的诱惑全集66 67 68 69 70集QOVD快播观看地址??www.15job.com广州天河区的南方人才市场www.toutoulu.comSEO行业外链怎么做?45gtv.comLETSCOM是什么牌子?www.k8k8.com谁能给我几个街污网站我去自己学henhenlu.com谁有大片地址呀 麻烦告诉我 谢谢啦 O会给你打满分的222cc.com有什么电影网站啊
虚拟主机99idc 重庆域名注册 宿迁服务器租用 老左 winscp 荷兰服务器 burstnet rak机房 godaddy支付宝 seovip debian7 申请空间 新天域互联 789电视网 网通服务器托管 四核服务器 带宽租赁 帽子云排名 石家庄服务器托管 双线空间 更多