Understanding Amazon Cloud Search


What is Amazon Cloud Search


Amazon CloudSearch is a fully-managed service in the AWS Cloud that makes it easy to set up, manage, and scale a search solution for your website or application

Understanding a Search Engine


A search engine makes it possible to search large collections of mostly textual data items (called documents) to quickly find the best matching results. Search requests are usually a few words of unstructured text, such as "matt damon movies". The returned results are usually ranked with the best matching, or most relevant, items listed first (the ones that are most "about" the search words).
Documents may be completely unstructured, or they can contain multiple fields that can optionally be searched individually. For example, a search service for movies might have documents with fields for title, director, actor, description, and reviews. Results returned by a search engine are typically proxies for the underlying documents, such as URLs that reference particular web pages. 

CloudSearch Benefits


Amazon CloudSearch provides several benefits over running your own self-managed search service including easy configuration, auto scaling for data and traffic, self-healing clusters, and high availability with Multi-AZ. With a few clicks in the AWS Management Console, you can create a search domain and upload the data you want to make searchable, and Amazon CloudSearch automatically provisions the required resources and deploys a highly tuned search index.

Amazon CloudSearch is a fully managed search service that automatically scales with the volume of data and complexity of search requests to deliver fast and accurate results. Amazon CloudSearch lets customers add search capability without needing to manage hosts, traffic and data scaling, redundancy, or software packages. Users pay low hourly rates only for the resources consumed. Amazon CloudSearch can offer significantly lower total cost of ownership compared to operating and managing your own search environment.

A search service and a storage service are complementary. A search service requires that your documents already be stored somewhere, whether it's in files of a file system, data in Amazon S3, or records in an Amazon DynamoDB or Amazon RDS instance. The search service is a rapid retrieval system that makes those items searchable with sub-second latencies through a process called indexing.

Supported Additional Features


With this latest release Amazon CloudSearch supports several new search and administration features. The key new features include:
· Language support:
34 languages, plus "multiple" to handle mixed language fields
Per-field language configuration
Language-specific text analysis
Multiple levels of algorithmic stemming are available for many languages, including "none"
· Enhanced search features:
Suggestions
Highlighting
Geospatial search
New data types: date, double, 64 bit signed int, latlon
Sloppy phrase search
Term boosting
Enhanced range searching for all field types
Support for multiple query parsers: simple, structured, lucene, dismax
Query parser configuration options

· Administration features:
High availability option
IAM integration
User configurable scaling

Comments

Popular posts from this blog

10 Data Science Interview Questions You must be Acquainted With

The pros and cons of Enterprise Architecture

5 WEBSITES EVERY MARKETING STUDENTS SHOULD KNOW