Skip to content

Latest commit

 

History

History
65 lines (37 loc) · 1.42 KB

section_web_search_challenges.md

File metadata and controls

65 lines (37 loc) · 1.42 KB

Web search challenges

Notes:


Technical challenges

  • Huge amount of data
  • Frequent updates
  • Unstructured content
  • Un-indexable content (images, binary files, proprietary formats)

Notes:

  • What can be the technical challenges?

Quality challenges

Page quality

  • ­Expertise
  • ­Authoritativeness
  • ­Trustworthiness

Notes:

  • Come up with examples of websites that rank high on one of those scales.

Quality challenges

Page quality

  • Spam
  • Duplicate content

Notes:

  • Come up with examples. How could you detect this?

Quality challenges

Result quality

  • User intent
  • Misspellings

Notes:

  • How could you detect user intent?