9238论搜索引擎的评测方法9238 on the evaluation method ofsearch engine
A long time ago, the search engine is not like today' s Allflowers bloom together. requirements of the people, low, aslong as it can be put on the Internet related website search,search to the site as a little more, the web site has a littleless able to meet. So at that time, the way people evaluate thesearch engine is to use a few keywords, test and compare theirsearch speed, search amount and the number of unrelatedwebsites. In short, it' s all inclusive. At that time, the searchengine technology is not very different, so this evaluationmethod is feasible.
Since then, the unique search engine technology has emerged oneafter another, and now it is obviously in the Warring Statesperiod. However, people' s evaluation methods have not changedmuch, and now the common evaluation is simply using severalkeywords to compare the search speed, the number of searchresults and the accuracy of their respective search.
Far not said that in the first quarter of 2001 after the upgradeof the askjeeves, you can play like as a phone in any phone inthe hands of askjeeves phone number, can also be labeled on thepage to enter the online about online speech, using yourcomputer' s microphone and speakers to communicate. Then youjust orally to it a request, it will put your voice into text,and then analyze your request to 7 million standard answers,it and other 2 million multimedia repository and Internet tofind the answer, find and then converted into voice to answeryou.
Imagine, if you ask, "the recent American election is pending,what do Americans think?"" After a while, the computer ortelephone to answer you: "according to the latest survey, thelast is if Bush is elected, 80% of Americans will accept himas the legitimate president, if Gore is finally elected, 79%of Americans will accept him as the legitimate president. " Ifyou ask, "who scored in the last World Cup finals?" "It answersyour name as well as the audio and video clips of the final goalfor you to enjoy. (of course, the audio video clips are basedon the fact that you're not using the phone, but the computer) .Although, askjeeves think their speech and search speed has tobe the degree of commercialization, but it will still have manyimmature, if you take a few keywords to test its search speedand precision and recall, and many of the common search engine,it came in where? If it' s behind you, is it a lousy searchengine?
One is evaluatingthe Internet searchengine is averydifficultthing, but a lot of evaluation results are ordinary Internetusers to see, is bound to take the Yahoo, include Sina portal,for them, is just a part of the Internet search, other kindsof search how to do? If you don't count, but the net civilianmuch; if it is, is a mess, where to?
Here, let' s analyze the capabilities flaws of several importantevaluation elements:
I. recall
Since it' s a search engine, first of all, it' s amatter of course,and if that fails, it doesn't seem to be necessary. Because thenumber of included pages each search engine announced, can thewhole letter, with a keyword search results is obviously, sothe general evaluation on this subject.
But to this date there are still many problems, most decentpoint of the search engine I can find a number of keywords toprove its search results is the most complete. Because althoughthe number of pages indexed in size, but the robot and spiderprogram, index scope and index standards are not the same, thebiggest search engine to be much smaller in the search engineto search.
Some search engines support "about", "of", "ah", and so on Whichevaluation mentioned?
In addition to the content is difficult to choose, the lengthis not good.
Some search engines do not support single Chinese charactersearch, how do you count it? Generally only a single keywordsearch, and multi keyword search it?How long is the search forlong sentences? Even search engines can support any articlesor fragments as keywords, so compare the results of the keywordsearch is not the same, not to mention the function of nocomparison. The semantic search engines like excite, as wellas the engines that support fuzzy search, and other searchengines that search for very few or even zero keywords, can finda whole bunch of results, and how do they compare?
Finally, the search engine can optimize the results forspecific keywords, and who will ensure the fairness of theevaluation? If one of the evaluated engines knows the keywordsin advance, then the champion is the only one that can be easilyoptimized.
Two: search speed
Recall ratio is faster than the search speed, if there aresearch engine index page is more, but search for a second fiveor six seconds or longer, directly ask it out, there is nomeaning than going down.
The problem of speed is at first in keyword, single keywordsearch is not fast, multi keyword search fast.
Then there is the problemof access, which is unfair for a searchengine with more than one hundred million of daily visits anda search engine with tens of thousands of visits per day.And the number of pages indexed, a search engine index 1 billion", another search engine index ten million", let them on thesame keywords in the database search results than the searchspeed, so how to convince people?
In addition to optimization problems, some search engines havethe memory search results accelerate the ability to transfera keyword, even the first word search took 10 seconds, secondsearch may be 2 seconds, third times, fourth times, when yougo to the test has always been 0.0001 seconds. So, if you choosea common word test, it' s amazing, if you come to a remote word,
maybe you can't get out of it for a long time. What keywordsshould you choose?How much do you usually use?This is reallya silly sum.
Search engines are not on the local machine in the lab, but forordinary users, so the search time should also include thesearch interface and search results of the transmissionprocess.A search engine took 0.0001 seconds, but it took 3 seconds toget the page, another search took 0.5 seconds, but it took asecond to send the page. Which search engine would you say isfaster? When you really use, you choose that 3.0001 secondslater to see the search results or 1.5 seconds later to see thesearch results?
Three: precision
This is very important, and the search is fast and fast, butthe result youwant doesn't knowhowmany pages youwant to find.What' s the result of this search? This kind of search engineis only useful when searching for rare things, but to searchfor rare things, you should go to the meta search engine. Whyuse it? The evaluation criteria of precision are difficult todetermine, and it depends on what you check. You have to lookfor a specific website and find a similar website. The key toprecision ratio is to search what and what keyword to choose,the judge can decide at random, and then affect the reliabilityof the evaluation results.
Four: dead link
General search engines have some search results that don' t go
anywhere, less than one percent, two, and eight or nine, andthis is often used as one of the evaluation criteria. But asGoogle uses web snapshot, there is almost no dead link problem,and even if the site in the search results is closed, you canstill see the web page that Google stores itself. How do youcalculate this kind of dead link?
Five: user burden
I haven' t seen anyone who has ever used this search engine inChina, but it' s an important factor in evaluating the pros andcons of search engines, including many aspects. Search enginesare for human use,
Make sure that people are comfortable, convenient, and quick,and that any user who hinders and delays the user' s access tothe final search results is charged by the user.
The first is the search interface, a pure search engineinterface with a search box, compared with a portal with adsand a large number of web pages, and their search burden forusers is high.
The second is to describe the search results, search resultspage description of the text is long or short, "the textdescription index with keyword part or the beginning of indexedpages indexed pages or a few lines of the main content, keywordsare highlighted by what color is not displayed page address,and the searchresults page layout, the the user' s searchburdenthere is a big difference.
Effect of addition is the user steps, whether can use the mouseto start the search, the search results page shows the numberis only 10, page convenient or not, the search box is two ora, above or below, a search keyword search is still displaycable box, every one of thesewill affect the search efficiency.Six: there are other
Do you want to search in this directory?,
Internet Index database update time,
Stability of search engines,
The ability to support advanced search should also beevaluated.
A person is not considerate, there may be other importantevaluation elements I did not mention, if you want to, hope toinform. See here, everyone on the limitations of the currentevaluation methods commonly used search engine must understand,of course, the most ridiculous is that I do not know is ignorantor tricky or special selection criteria, some Chinese searchengine evaluation this year to do not even include Google, aswell as a long list of celebrities can row the violin missedPaganini.
It' s really hard to evaluate a search engine.
JUSTG,这个主机商第二个接触到,之前是有介绍到有提供俄罗斯CN2 GIA VPS主机活动的,商家成立时间不久看信息是2020年,公司隶属于一家叫AFRICA CLOUD LIMITED的公司,提供的产品为基于KVM架构VPS主机,数据中心在非洲(南非)、俄罗斯(莫斯科),国内访问双向CN2,线路质量不错。有很多服务商实际上都是国人背景的,有的用英文、繁体搭建的冒充老外,这个服务商不清楚是不是真...
易探云怎么样?易探云隶属于纯乐电商旗下网络服务品牌,香港NTT Communications合作伙伴,YiTanCloud Limited旗下合作云计算品牌,数十年云计算行业经验。发展至今,我们已凝聚起港内领先的开发和运维团队,积累起4年市场服务经验,提供电话热线/在线咨询/服务单系统等多种沟通渠道,7*24不间断服务,3分钟快速响应。目前,易探云提供香港大带宽20Mbps、16G DDR3内存、...
美得云成立于2021年,是一家云产品管理服务商(cloud)专业提供云计算服务、DDOS防护、网络安全服务、国内海外数据中心托管租用等业务、20000+用户的选择,43800+小时稳定运行香港特价将军澳CTG+CN2云服务器、采用高端CPU 优质CN2路线 SDD硬盘。香港CTG+CN22核2G3M20G数据盘25元点击购买香港CTG+CN22核2G5M30G数据盘39元点击购买香港CTG+CN...