editions59ddd.com

59ddd.com  时间:2021-03-20  阅读:()
R&DConnectionsNovember2004TestingandTimeLimitsBrentBridgeman,AmandaMcBride,&WilliamMonaghanTestingandtimelimits.
It'sanalmostinevitableunion—andforgoodreason,manywouldargue.
Imposingtimelimitsontestscanservearangeofimportantfunctions.
Timelimitsareessential,forexample,ifspeedofperformanceisanintegralcomponentofwhatisbeingmeasured,aswouldbethecasewhentestingsuchskillsashowquicklysomeonecantype.
Limitingtestingtimealsohelpscontainexpensesassociatedwithtestadministrations,suchaspayinghourlyfeesforproctorsinapaper-basedadministrationorforseattimeatcomputertestingcenters.
Butlimitingtestingtimetoodrasticallycanthreatenatest'svalidity,ortheabilityofthetesttoaccuratelyreflectwhatthetestwasdesignedtomeasure.
Thisisparticularlytrueifthetestisnotintendedtomeasurehowquicklythetesttakercananswerquestionsorifthetestingtimeissolimitedthatalargenumberofexamineestakingthetestcannotcompleteit;thatis,ifthetestis"speeded.
"Speedednessintestingreferstotheeffectthattimelimitshaveontesttakers'scores.
Whenatest'stimelimitsareconstrainedtothepointthatmosttesttakersdonothaveenoughtimetoconsiderandanswereachquestion,thetestissaidtobe"speeded.
"Atestisspeededtotheextentthatthosetakingitscorelowerthantheywouldhaveiftheyhadbeengivenanunlimitedamountoftimetocompleteit.
FortestssuchastheGREandCollegeBoard'sSAT,whichareintendedtomeasureskillsrelatedtoacademicabilityratherthantherateatwhichexamineescanwork,thespeedatwhichtesttakersanswerthequestionsshouldplayaminorrole,atmost,indeterminingtestscores(Briel,O'Neill,&Scheuneman,1993;Donlon,1984).
Consequently,timelimitsforsuchtestsshouldgivemosttesttakersenoughtimetofinishthetest,andamodesttimeextensionshouldhavearelativelysmalleffectonoveralltestscores(Bridgeman,Cline,&Hessinger,2003).
Whileit'spossiblethattimelimitscanaffectthescoresofalltesttakers,somehavesuggestedthatsuchlimitsmaydifferentiallyaffectfemaleandminoritytesttakers.
Someclaimthatthe"fast-paced,orspeedednature"oftheSATputsfemaletesttakersatadisadvantageoncertaintestsectionsbecausetheyapproachproblem-solvingdifferentlythantheirmalecounterparts—femaletesttakers,theysay,aremorelikelytoworkproblemsoutcompletely,toconsidermorethanonepossibleanswer,andtochecktheirwork(Becker,1990;Linn,1992).
Othershavenotedwhatseemstobeacommonbeliefamongtesttakersandtheirfamilies(andevenamongsomeschoolcounselors)thatgivingexamineesmoretimetocompleteatestcouldsubstantiallyimprovetheirscores.
Thishasraisedconcernsoverthepossibilitythatnondisabledstudentsmayattempttoobtainextended-timeaccommodations(whichETSprovidestoexamineeswithdocumenteddisabilitiesthatrequireadditionaltestingtime,suchaslearningdisabilities,Attention-Deficit/HyperactivityDisorder,orsightproblems),andthusgainaperceivedadvantageonstandardizedtests(Bridgeman,Trapani,&Curley,2003;Mandinach,Cahalan,&Camara,2002).
Butifevidencesuggeststhatextratimedoesnotimprovetesttakerperformance,studentswouldhavelittleornomotivationtomanipulatethesystemtoreceiveextratest-takingtimethatthey'renotentitledto.
AndtherewouldbelessListening.
Learning.
Leading.
reasontoflag1thescoresofstudentswhoweregrantedextendedtime,apracticethathasengenderedfiercedebatesinceitsimplementationdecadesago.
EffectofExtraTimeonSATTestScoresWithallthisinmind,theobviousquestionsseemtobe,whathappenswhentesttakersaregivenmoretimetocompleteastandardizedtestDotesttakers'scoresimprovewhentheyaregivenmoretimeAndifso,byhowmuchTobegintoanswerthesequestions,Bridgeman,Trapani,andCurley(2003)placedSATReasoningTestsectionswithafewernumberofquestionsintothestandard30-minutevariablesectionoftwonationaltestadministrations.
Thissectiondoesnotcounttowardthefinalscoresoftesttakers,butisusedtotryoutnewquestionsandtoensurethatscoresonneweditionsofthetestarecomparabletothoseonearliereditions.
Theresearcherscreatedthereducednumbersectionsbydeletingquestionsfromaverbalsectionthatcontained35questions,toproducetwosetsofforms,onewith27questionsandanotherwith23.
Thescoresonthe23questionscouldthenbecomparedtothescoresonthesame1"Flagging"referstothepracticebywhichadministratorsofstandardizedtestsplaceasterisksorothersimilarnotationsonthescorereportsofpeoplewithdisabilitieswhotakeexamsundercertainnonstandardconditions.
Theseconditionsusuallyinvolveanaccommodationonoramodificationtothetestandmayincludeprovidingpeopletoreadthetestinstructionsandquestionsaloud,large-printandBrailleformsofthetest,individualizedadministration,orextendedtime.
Accommodationsareintendedtoeliminateirrelevantsourcesofdifficultythatarerelatedtothedisabilitybutnottotheconstructbeingassessed.
It'sworthnotingthatthenumberofstudentsrequestingextratimehasgrownbyabout26percentoverthepastfiveyears(Camara,Copeland,&Rothschild,1998).
It'salsoimportanttonotethat,asofOct.
1,2001,ETSnolongerflagsscoresofteststhatwereadministeredunderanaccommodationofextendedtime.
23questionsinthesectionscontainingthe27or35questions.
Thiswasdoneforboththemathandtheverbalsectionsofthetest.
AscanbeseeninFigures1,2,3,and4,theresearchersfoundthatallowingmoretimeperquestion(theequivalentoftime-and-a-half)hadminimalimpactonverbalscores,producinggainsoflessthan10pointsonthe200-800SATscale.
Infact,inthefirststudy,scoresforthelowerabilitygroup(thosewhoscoredbelow400)actuallydecreasedwithextratime.
TheseresultssuggestthattheSATverbalsectionisonlyslightlyspeeded.
Themathsectionappearstobemorespeededthantheverbalsection,butnothighlyspeeded:Theequivalentoftime-and-a-halfraisedscoresabout20points,althoughtheincreasewassomewhatgreater(17-26points)forhigherabilitystudents(abilitylevel>600).
Forbothsections,increasingthetimetendedtobenefithigh-scoringstudentsmorethanlower-scoringstudents,withextratimecreatingnoincreaseinscoresforstudentswithSATscoresof400orlower(abilitylevel600>60030-Items30-Items25-Items25-Items35040045050055060065070060025-Items20-Items17-ItemsAbilityGroupsStudy135040045050055060065070060025-Items20-Items17-ItemsFigure3.
Meanscoreson17M1itemswithstandardtiming(embeddedina25-itemsection),andwithtwolessspeededconditions(embeddedina20-itemsectionandasacomplete17-itemsection).
AbilityGroupsStudy235040045050055060065070060025-Items22-Items35040045050055060065070060025-Items22-ItemsFigure4.
Meanscoreson22M2itemswithstandardtiming(embeddedina25-itemsection),andwithalessspeededcondition(acomplete22-itemsection).
Source:Bridgeman,Trapani,&Curley,2003.
AbilityGroupsStudy1AbilityGroupsStudy235040045050055060065070070060035-Items27-Items23-ItemsAbilityGroupsStudy1Figure1.
Meanscoreson23V1itemswithstandardtiming(embeddedina35-itemsection),andwithtwolessspeededconditions(embeddedina27-itemsectionandasacomplete23-itemsection).
35040045050055060065035-Items27-Items23-Items410-600>60060030-Items25-ItemsAbilityGroupsStudy1650600550500450400350AbilityGroupsStudy2Page3of6600Figure2.
Meanscoreson25M1itemswithstandardtiming(embeddedina30-itemsection),andwithalessspeededconditions(acomplete25-itemsection).
25-Items30-ItemsEffectofExtraTimeonQuantitativeandVerbalGREScoresAswiththeSAT,timelimitsfortheGREGeneralTestareintendedtobesetsothatmosttesttakerscancompletethetest.
Amodesttimeextension,then,shouldhavearelativelysmalleffectontestscores.
TheresultsfromtheSATstudy,however,cannotbeappliedtothecurrentcomputer-adaptiveGREGeneralTestbecauseofthecontentandtimingdifferencesofthetwotests,andbecauseofthedifferencesbetweencomputer-adaptivetesting(CAT)andpaper-basedadministration.
InaCAT,unlikepaper-basedtests,differentexamineesreceivedifferentsetsofquestions.
2UnlikemanyCATs,theGRECAThasafixednumberofquestionsandstricttimelimitsforeachsection,althoughitisnotintendedtobeaspeededtest.
ToinvestigatespeedednessandtheGRECAT,Bridgeman,Cline,andHessinger(2003)performedastudyinwhicharesearchsectionwasaddedtotheendofregularadministrationsoftheCATGRE.
VolunteerstookeitheraverbaloraquantitativeGREsectionwitheitherstandardtimingorone-and-a-halftimesthestandardtimelimit.
Toencouragemotivatedperformance,participantswereeligibleforacashpaymentiftheydidaswellontheexperimentalsectionastheydidontheoperationalsections.
2Incomputer-adaptivetesting,thecomputerselectstherangeofquestionsthatisappropriatetoeachtesttaker'sabilitylevel.
Testtakersreceiveasetofquestionsthatmeettestdesignspecificationsandgenerallyareappropriateforeachtesttaker'sperformancelevel.
Questionsarechosenfromalargepoolofpossiblequestionscategorizedbycontentanddifficulty.
(Thecontentandtypesofquestionsaresimilartothatfoundincomparablepaper-basedtests.
)Thecomputer-adaptiveteststartswithquestionsofmoderatedifficulty.
Asthecandidateanswerseachquestion,thecomputerscoresthequestionandusesthatinformation,aswellasthecandidate'sresponsestopreviousquestions,todeterminewhichquestionispresentednext.
Aslongasthetesttakerrespondscorrectly,thecomputertypicallyselectsanextquestionofgreaterdifficulty.
Incontrast,ifthetesttakeranswersaquestionincorrectly,thecomputertypicallyselectsanextquestionoflesserdifficulty.
Subsequentquestionsarepresentedbasedinpartonthetesttaker'sperformanceonpreviousquestionsandinpartonthetestdesign.
Inotherwords,thecomputerisprogrammedtofulfillthetestdesignasitcontinuouslyadjuststofindquestionsofappropriatedifficultyfortesttakersofallperformancelevels.
AsTables1and2show,resultsfromthisstudyindicatethatextratimehadaminimaleffectonoverallscores,addingonlyabout7pointstoverbalscoresand7pointstoquantitativescoresonthe200-800scorescale.
And,aswasthecaseintheSATstudy,scoresunderthedifferentconditionswerecomparableacrossgenderandethnicgroups,althoughquantitativescoreswereslightlyhigherforlowerabilityexamineeswhohadmoretime.
Note,however,thattherearesomeimportantdifferencesbetweentheSATandGRE.
TheSATsubtractsafractionofapointforeveryquestionthatisansweredincorrectly,sothatitisbettertoleaveaquestionunansweredthantogiveanincorrectanswer.
TheGRE,ontheotherhand,hasapenaltyforleavingquestionsunansweredattheend.
QuestionsontheSATarearrangedforthemostparttobecomesuccessivelymoredifficult.
Lowerabilitytesttakersaremorelikelytoguessandgiveincorrectanswerstothelattersetofquestions,resultinginanegativeeffectontheirscores.
However,thisisnottrueforsectionswithreadingpassages,whichmakeupthemajorityoftheverbaltest.
Orderofthoseitemsisdependentuponwherethetopicstheindividualitemsrefertoappearinthepassage.
OntheGRECAT,lowerabilitytesttakerswouldreceivequestionsatorclosetotheirabilityleveltowardtheendofthetest,lesseningtheirneedtoguess.
ImpactofTimeLimitsonComputer-AdaptiveTestsAsmentionedearlier,theGRECATisnotintendedtobeaspeededtest,buthasafixednumberofquestionsandsectiontimelimits.
Sowhathappenswhentimelimitsareimposedonteststhatgivedifferentquestionstodifferentexaminees,particularlyifquestionsthataresupposedtobeequallydifficulttendtohavesubstantialdifferencesinthetimeittakestoanswerthemBridgemanandCline(2000)foundthatsomeofthequestionsintheGRE'sanalyticalandPage4of6quantitativesectionscouldbeansweredmuchmorequicklythanothers.
Theresearchersalsonotedthatwhilesomeofthisvariationinresponsetimewasrelatedtothedifficultyofthequestions—moredifficultquestionstendedtotakelongertoanswerthanlessdifficultones—therealsowassubstantialvariationinthetimerequiredtoanswerquestionsofroughlythesamedifficultylevelandmeetingthesamecontentspecifications.
Table1SampleSizes,Means,andStandardDeviationsforResearchGREQuantitativeScoresTimingconditionStatisticStandard(45min.
)Extended(68min.
)Differencen3,9043,749M6646717SD125121Table2SampleSizes,Means,andStandardDeviationsforResearchGREVerbalScoresTimingconditionStatisticStandard(30min.
)Extended(45min.
)Differencen4,1974,098M4544617SD122120Source:Bridgeman,Cline,&Hessinger,2003.
Giventhesefindings,itseemedconceivablethatexamineesreceivingtime-consumingtests(i.
e.
,thosewhogetadisproportionatenumberofitemsthattakealonger-than-averagetimetoanswer)couldbedisadvantagedand,asaresult,receivelowerscorescomparedtotesttakerswhogetalesstime-consumingtest.
Yet,uponfurtherinvestigation,BridgemanandCline(2000)couldfindnoevidenceofimpactontotaltestscores.
Inarelatedstudy,however,BridgemanandCline(2004)didfindevidencethattesttakersontheanalyticalsectionoftheGREwereindeedaffectedbythiscombinationofconditions,whichresultedintesttakershavingtoguessonthefinalquestionsinordertofinishthetestbeforerunningoutoftime.
Testtakersatthehigherabilitylevelstendedtoguessmorethanthoseatthelowerabilitylevelsbecausethequestionsadministeredtohigherabilityexamineesweretypicallymoretime-consuming.
Sinceguessingincreasesthechancesofansweringitemsincorrectly(whichwouldloweratesttaker'sscore),thesefindingsindicatethatexamineeswhoareadministeredtestswithadisproportionatenumberoftime-consumingitemsarelikelytogetlowerscoresthanthoseofcomparableabilitywhoreceivetestscontainingitemsthatcanbeansweredmorequickly.
It'sworthnotingthattheGRE'sanalyticalsectionhasbeenreplacedbytwoessaypromptsthatassessanalyticalwritingskills.
Althoughthepotentialproblemnotedabovecontributedtothisdecision,itwasnottheonlyconsideration(Bridgeman&Cline,2004).
ImplicationsThisresearchindicatesthatindividualstakingeithertheSATortheverbalandmathsectionsoftheGRECAThavesufficienttimetoanswerthequestions.
Thesetestsarenotspeededtoanysignificantdegree,andgivingtesttakersmoretimetocompletetheseitemsdoesnotresultinsignificantscoregains.
Thescoregainsthatwereachieved(lessthan10pointsfortheverbalsectionandlessthan30pointsforthemathsection,ona200-800scale)wereextremelyminorandwouldcertainlynotmakeorbreakastudent'seducationalaspirations.
Moreover,scoregainswerenotconsistentacrossabilitylevels:Fortheseassessments,high-scoringtesttakerstendedtobenefitmorethanlower-scoringstudents,withextratimecreatingnoincreaseinscoresforstudentswithSATscoresof400orlower.
Furthermore,racial/ethnicandgenderdifferenceswereneitherincreasednorreducedwithextratime,challengingargumentsthattheso-called"speeded"natureoftheSATdisadvantagesminorityandfemaletesttakers.
TheseresultsshouldhelptoreducethemotivationforstudentswhoarenotdisabledtoPage5of6manipulatethesysteminanattempttoobtainunwarrantedextended-timeaccommodations.
Atthesametime,testusersshouldnotbeoverlyconcernedthatsomestudentsmightbegaininganunfairadvantageinthismanner,sinceanysuchadvantagewouldlikelybequitesmall.
StudieswereconflictingregardingwhetherornottheAnalyticsectionoftheGRECATwasspeeded.
Althoughthemostrecentstudy(Bridgeman&Cline,2004)makeastrongargumentthatthetestwasindeedspeeded,itisnowamootpointsinceETSnolongeradministersthissection.
However,theinformationobtainedinthisstudyshouldproveusefultodevelopingfutureCATswithstricttimelimits.
ReferencesBecker,B.
J.
(1990).
ItemcharacteristicsandgenderdifferencesontheSAT-Mformathematicallyableyouths.
AmericanEducationalResearchJournal,27,65-87.
Bridgeman,B.
(2004,April).
Speedednessasathreattoconstructvalidity.
PaperpresentedattheannualmeetingoftheNationalCouncilonMeasurementinEducation,SanDiego,CA.
RetrievedOct.
19,2004,fromtheETSWebsite:http://www.
ets.
org/research/dload/NCME_2004-Bridgeman.
pdfBridgeman,B.
&Cline,F.
(2004).
Effectsofdifferentiallytime-consumingtestsoncomputer-adaptivetestscores.
JournalofEducationalMeasurement,41,137-148.
Bridgeman,B.
,&Cline,F.
(2000).
Variationsinmeanresponsetimesforquestionsonthecomputer-adaptiveGREGeneralTest:Implicationsforfairassessment(ETSRR-00-7).
RetrievedOct.
19,2004,fromtheETSWebsite:http://ftp.
ets.
org/pub/res/researcher/RR-00-07-Bridgeman.
pdfBridgeman,B.
,Cline,F.
,&Hessinger,J.
(2003).
EffectofextratimeonGREQuantitativeandVerbalscores(ETSRR-03-13).
RetrievedOct.
19,2004,fromtheETSWebsite:http://ftp.
ets.
org/pub/res/researcher/RR-03-13-Bridgeman.
pdfBridgeman,B.
,Trapani,C.
,&Curley,E.
(2003).
EffectoffewerquestionspersectiononSATIscores(CollegeBoardReportNo.
2003-2).
RetrievedOct.
19,2004,fromtheCollegeBoardWebsite:http://www.
collegeboard.
com/research/pdf/rdcbreport20032web_23502.
pdfBriel,J.
B.
,O'Neill,K.
A.
,&Scheuneman,J.
D.
(1993).
GREtechnicalmanual.
Princeton,NJ:ETS.
Camara,W.
,Copeland,T.
,&Rothschild,B.
(1998).
EffectsofextendedtimeontheSAT:Reasoningtestscoregrowthforstudentswithlearningdisabilities(CollegeBoardReportNo.
98-7).
RetrievedOct.
19,2004,fromtheCollegeBoardWebsite:http://www.
collegeboard.
com/research/pdf/rr9807_3912.
pdfDonlon,T.
F.
(Ed.
).
(1984).
TheCollegeBoardtechnicalhandbookfortheScholasticAptitudeTestandAchievementTests.
NewYork:CollegeEntranceExaminationBoard.
Linn,M.
C.
(1992).
Genderdifferencesineducationalachievement.
InSexequityeducationalopportunity,achievement,andtesting:Proceedingsofthe1991ETSInvitationalConference(pp.
11–50).
Princeton,NJ:ETS.
Mandinach,E.
,Cahalan,C.
,&Camara,W.
(2002).
Theimpactofflaggingontheadmissionprocess:Policies,practices,andimplications(CollegeBoardReportNo.
2002-2).
RetrievedOct.
19,2004,fromtheCollegeBoardWebsite:http://www.
collegeboard.
com/research/pdf/02595020604txtcvr_11433.
pdfR&DConnectionsispublishedbyETSResearch&DevelopmentEducationalTestingServiceRosedaleRoad,19-TPrinceton,NJ08541-0001SendcommentsaboutthispublicationtotheaboveaddressorviatheWebat:http://www.
ets.
org/research/contact.
htmlCopyright2004byEducationalTestingService.
Allrightsreserved.
EducationalTestingServiceisanAffirmativeAction/EqualOpportunityEmployer.
EducationalTestingService,ETS,andtheETSlogoGraduateRecordExaminations,andGREareregisteredtrademarksofEducationalTestingService.
CollegeBoardandSATareregisteredtrademarksoftheCollegeEntranceExaminationBoard.
SATReasoningTestisatrademarkoftheCollegeEntranceExaminationBoard.
Listening.
Learning.
Leading.
Page6of6

RAKsmart含站群服务器/10G带宽不限流量首月半价

RAKsmart 商家估摸着前段时间服务器囤货较多,这两个月的促销活动好像有点针对独立服务器。前面才整理到七月份的服务器活动在有一些配置上比上个月折扣力度是大很多,而且今天看到再来部分的服务器首月半价,一般这样的促销有可能是商家库存充裕。比如近期有一些服务商挖矿服务器销售不好,也都会采用这些策略,就好比电脑硬件最近也有下降。不管如何,我们选择服务器或者VPS主机要本着符合自己需求,如果业务不需要,...

wordpress外贸集团企业主题 wordpress高级推广外贸主题

wordpress外贸集团企业主题,wordpress通用跨屏外贸企业响应式布局设计,内置更完善的外贸企业网站优化推广功能,完善的企业产品营销展示 + 高效后台自定义设置。wordpress高级推广外贸主题,采用标准的HTML5+CSS3语言开发,兼容当下的各种主流浏览器,根据用户行为以及设备环境(系统平台、屏幕尺寸、屏幕定向等)进行自适应显示; 完美实现一套主题程序支持全部终端设备,保证网站在各...

Hostiger 16G大内存特价VPS:伊斯坦布尔机房,1核50G SSD硬盘200Mbps带宽不限流量$59/年

国外主机测评昨天接到Hostigger(现Hostiger)商家邮件推送,称其又推出了一款特价大内存VPS,机房位于土耳其的亚欧交界城市伊斯坦布尔,核50G SSD硬盘200Mbps带宽不限月流量只要$59/年。 最近一次分享的促销信息还是5月底,当时商家推出的是同机房同配置的大内存VPS,价格是$59.99/年,不过内存只有10G,虽然同样是大内存,但想必这次商家给出16G,价格却是$59/年,...

59ddd.com为你推荐
网络访问域名访问提示是什么意思国家网络安全部国家网络安全西部妈妈网加入新疆妈妈网如何通过验证?access数据库access数据库主要学什么22zizi.com河南福利彩票22选52010175开奖结果www.kkk.com谁有免费的电影网站,越多越好?罗伦佐娜罗拉芳娜 (西班牙小姐)谁可以简单的介绍以下百花百游百花净斑方多少钱一盒www.55125.cn如何登录www.jbjy.cnwww.7788k.comwww.6601txq.com.有没有这个网站
什么是虚拟主机 域名批量查询 韩国vps俄罗斯美女 德国vps krypt buyvm 狗爹 美元争夺战 iisphpmysql 网站实时监控 dux 新世界服务器 上海电信测速 谷歌台湾 万网空间 免费个人网页 稳定空间 带宽测试 上海联通 百度新闻源申请 更多