我要翻译啊,急啊,帮帮忙啊,好急的哦

来源:百度知道 编辑:UC知道 时间:2024/05/17 22:27:46
1.2. Google: Scaling with the Web
Creating a search engine which scales even to today's web presents many challenges. Fast crawling technology is needed to gather the web documents and keep them up to date. Storage space must be used efficiently to store indices and, optionally, the documents themselves. The indexing system must process hundreds of gigabytes of data efficiently. Queries must be handled quickly, at a rate of hundreds to thousands per second.
These tasks are becoming increasingly difficult as the Web grows. However, hardware performance and cost have improved dramatically to partially offset the difficulty. There are, however, several notable exceptions to this progress such as disk seek time and operating system robustness. In designing Google, we have considered both the rate of growth of the Web and technological changes. Google is designed to scale well to extremely large data sets. It makes efficient use of storage space to store the index. Its data s

1.2.Google: 按比例缩放与 网络创建的缩放甚至可以今天的网络搜索引擎提供了许多挑战。 快速搜网技术需要收集 Web 文档并使它们保持最新。 必须有效地使用存储空间来存储索引或文档本身。 索引系统必须有效地处理数百亿兆的数据。 查询必须迅速,达到处理数百个到千位每秒的速度。
在网上这些索引任务变得越来越困难。 但硬件性能和成本已显著向部分偏移困难改进。 但是有几个明显的例外情况在此计划外(如磁盘搜索时间和操作系统可靠性。) 在设计 Google时,我们已经考虑到网络和技术变革的增长的速率。 Google 旨在为扩展到更大的数据集。 它使存储空间来存储使索引有效地使用。 其数据结构是经过优化快速和有效的访问 (请参阅 4.2部分)。 进一步,我们预计的成本来编制索引和存储文本或 HTML 最终将拒绝相对可用的量 (请参阅附录 B)。 这将导致成为集中式系统(如 Google 属性大为缩放)。