Transfer knowledge from provisioned domains to Serverless
Setup Amazon OpenSearch Ingestion
To get began, you could have an lively OpenSearch Service area (supply) and OpenSearch Serverless assortment (sink). Full the next steps to arrange an OpenSearch Ingestion pipeline for migration:
- On the OpenSearch Service console, select Pipeline underneath Ingestion within the navigation pane.
- Select Create a pipeline.
- For Pipeline title, enter a reputation (for instance,
octank-migration
).
- For Pipeline capability, you possibly can outline the minimal and most capability to scale up the sources. For now, you possibly can go away the default minimal as 1 and most as 4.
- For Configuration Blueprint, choose
AWS-OpenSearchDataMigrationPipeline
.
- Replace the next info for the supply:
- Uncomment
hosts
and specify the endpoint of the prevailing OpenSearch Service endpoint.
- Uncomment
distribution_version
in case your supply cluster is an OpenSearch Service cluster with compatibility mode enabled; in any other case, go away it commented.
- Uncomment
indices
, embrace
, index_name_regex
, and add an index title or sample that you simply wish to migrate (for instance, octank-iot-logs-2023.11.0*
).
- Replace
area
underneath aws
the place your supply area is (for instance, us-west-2
).
- Replace
sts_role_arn
underneath aws
to the function that has permission to learn knowledge from the OpenSearch Service area (for instance, arn:aws:iam::111122223333:function/osis-pipeline
). This function needs to be added as a backend function inside the OpenSearch Service safety roles.
- Replace the next info for the sink:
- Uncomment
hosts
and specify the endpoint of the prevailing OpenSearch Serverless endpoint.
- Replace
sts_role_arn
underneath aws
to the function that has permission to jot down knowledge into the OpenSearch Serverless assortment (for instance, arn:aws:iam::111122223333:function/osis-pipeline
). This function needs to be added as a part of the information entry coverage within the OpenSearch Serverless assortment.
- Replace the
serverless
flag to be true
.
- For
index
, you possibly can go away it as default, which is able to get the metadata from the supply index and write to the identical title within the vacation spot as of the sources. Alternatively, if you wish to have a unique index title on the vacation spot, modify this worth along with your desired title.
- For
document_id
, you will get the ID from the doc metadata within the supply and use the identical within the goal. Notice that customized doc IDs are supported just for the SEARCH kind of assortment; you probably have your assortment as TIMESERIES or VECTORSEARCH, you need to remark this line.
- Subsequent, you possibly can validate your pipeline to verify the connectivity of supply and sink to ensure the endpoint exists and is accessible.
- For Community settings, select your most popular setting:
- Select VPC entry and choose your VPC, subnet, and safety group to arrange the entry privately.
- Select Public to make use of public entry. AWS recommends that you simply use a VPC endpoint for all manufacturing workloads, however this walkthrough, choose Public.
- For Log Publishing Possibility, you possibly can both create a brand new Amazon CloudWatch group or use an present CloudWatch group to jot down the ingestion logs. This supplies entry to details about errors and warnings raised through the operation, which may also help throughout troubleshooting. For this walkthrough, select Create new group.
- Select Subsequent, and confirm the small print you specified to your pipeline settings.
- Select Create pipeline.
It ought to take a few minutes to create the ingestion pipeline.
The next graphic offers a fast demonstration of making the OpenSearch Ingestion pipeline by way of the previous steps.

Confirm ingested knowledge within the goal OpenSearch Serverless assortment
After the pipeline is created and lively, log in to OpenSearch Dashboards to your OpenSearch Serverless assortment and run the next command to record the indexes:
GET _cat/indices?v
The next graphic offers a fast demonstration of itemizing the indexes earlier than and after the pipeline turns into lively.

Conclusion
On this publish, we noticed how OpenSearch Ingestion can ingest knowledge into an OpenSearch Serverless assortment with out the necessity to use the third-party options. With minimal knowledge producer configuration, it routinely ingested knowledge to the gathering. OSI additionally permits you to remodel or reindex the information from ES7.x model earlier than ingestion to an OpenSearch Service area or OpenSearch Serverless assortment. OSI eliminates the necessity to provision, scale, or handle servers. AWS provides numerous sources so that you can shortly begin constructing pipelines utilizing OpenSearch Ingestion. You should use numerous built-in pipeline integrations to shortly ingest knowledge from Amazon DynamoDB, Amazon Managed Streaming for Apache Kafka (Amazon MSK), Amazon Safety Lake, Fluent Bit, and lots of extra. The next OpenSearch Ingestion blueprints allow you to construct knowledge pipelines with minimal configuration modifications.
Concerning the Authors
Muthu Pitchaimani is a Search Specialist with Amazon OpenSearch Service. He builds large-scale search functions and options. Muthu is within the matters of networking and safety, and is predicated out of Austin, Texas.
Prashant Agrawal is a Sr. Search Specialist Options Architect with Amazon OpenSearch Service. He works intently with clients to assist them migrate their workloads to the cloud and helps present clients fine-tune their clusters to realize higher efficiency and save on value. Earlier than becoming a member of AWS, he helped numerous clients use OpenSearch and Elasticsearch for his or her search and log analytics use instances. When not working, you will discover him touring and exploring new locations. Briefly, he likes doing Eat → Journey → Repeat.
Rahul Sharma is a Technical Account Supervisor at Amazon Internet Companies. He’s passionate in regards to the knowledge applied sciences that assist leverage knowledge as a strategic asset and is predicated out of NY city, New York.