A lambda written in Rust that consumes a dynamodb stream for identity-users and indexes users into the identity-search OpenSearch instance.
Depends on D9877
Paths
| Differential D9936 Authored by will on Nov 20 2023, 6:36 PM.
Details Summary A lambda written in Rust that consumes a dynamodb stream for identity-users and indexes users into the identity-search OpenSearch instance. Depends on D9877 Test Plan Successfully built using cargo lambda and deployed with terraform. Confirmed manually the lambda is syncing the stream.
Diff Detail
Event TimelineHerald added subscribers: tomek, ashoat. · View Herald TranscriptNov 20 2023, 6:36 PM2023-11-20 18:36:28 (UTC-8) will retitled this revision from [services] Rust opensearch indexing lambda to [services] [4/n] Rust opensearch indexing lambda.Nov 20 2023, 6:38 PM2023-11-20 18:38:08 (UTC-8) will edited the summary of this revision. (Show Details)Nov 20 2023, 6:40 PM2023-11-20 18:40:59 (UTC-8) will added a parent revision: D9877: [terraform] [2/n] enable streams for DynamoDB identity-users table. will added a child revision: D9878: [terraform] [4/n] search_index_lambda terraform configuration options. Harbormaster returned this revision to the author for changes because remote builds failed.Nov 20 2023, 6:53 PM2023-11-20 18:53:45 (UTC-8) will retitled this revision from [services] [4/n] Rust opensearch indexing lambda to [services] [3/n] Rust opensearch indexing lambda.Nov 21 2023, 12:15 AM2023-11-21 00:15:30 (UTC-8) Harbormaster completed remote builds in B24321: Diff 33477.Nov 21 2023, 8:09 PM2023-11-21 20:09:13 (UTC-8) bartek added inline comments.
This revision now requires changes to proceed.Nov 23 2023, 3:26 AM2023-11-23 03:26:42 (UTC-8) Comment Actions I tried my hand at using remote derive to implement the Deserialization trait. However, many of the structs are non-exhaustive, preventing usage of constructors outside of the crate. This means I'm unable to implement structs with the remote attribute. Tried looking but I don't know if there's any solutions to this. Instead, I just opted to writing my own structs. Comment Actions
Harbormaster completed remote builds in B24611: Diff 33999.Nov 29 2023, 11:05 AM2023-11-29 11:05:31 (UTC-8) Harbormaster completed remote builds in B24610: Diff 33998.Nov 29 2023, 11:17 AM2023-11-29 11:17:23 (UTC-8) Harbormaster completed remote builds in B24613: Diff 34001.Nov 29 2023, 11:24 AM2023-11-29 11:24:29 (UTC-8) Comment Actions Thanks for addressing these, good job! I'll let @varun take a look
That's unfortunate but yeah, writing own structs is a reasonable solution. You can optionally add a comment that they correspond to DynamoDB Streams types. Comment Actions what happens if we hit an expect and panic? wondering if we need to handle errors more gracefully
This revision now requires changes to proceed.Nov 30 2023, 3:58 PM2023-11-30 15:58:10 (UTC-8) Comment Actions
Since lambda is "serverless", I assume it's one of two scenarios:
I have a feeling that introducing custom error types is kinda overkill here, the codebase is rather small. Do you think that anyhow context would fit better here? Then we could map anyhow::Error to lambda_runtime::Error in main(). In fact this is just Box<dyn Error>.
Comment Actions
A single error from the lambda can block further processing of other entries for up to a day by default. We can configure this behavior in our terraform configuration. There are a few options:
I think we could definitely use anyhow Harbormaster completed remote builds in B25143: Diff 34753.Dec 15 2023, 6:11 PM2023-12-15 18:11:44 (UTC-8) Harbormaster completed remote builds in B25144: Diff 34754.Dec 15 2023, 6:31 PM2023-12-15 18:31:01 (UTC-8) Comment Actions Not sure why we've converted expect()'s to unwrap()'s.
is there a task tracking this?
This revision now requires changes to proceed.Dec 18 2023, 11:08 AM2023-12-18 11:08:43 (UTC-8) Comment Actions
When we use anyhow and Result instead of panicking, this doesn't matter. Returning Err from main() doesn't kill the whole instance (batch?), just fails a single invocation. Comment Actions
I've addressed part of this in [4/n] with: resource "aws_lambda_function_event_invoke_config" "example" { function_name = aws_lambda_function.search_index_lambda.function_name maximum_event_age_in_seconds = 60 maximum_retry_attempts = 2 } When running prior batches, I was encountering problems with it running with too many retry attempts. Here's an issue I modified with the current context: Harbormaster completed remote builds in B25191: Diff 34808.Dec 18 2023, 11:51 AM2023-12-18 11:51:32 (UTC-8) This revision is now accepted and ready to land.Dec 29 2023, 5:23 AM2023-12-29 05:23:39 (UTC-8) Harbormaster completed remote builds in B25558: Diff 35323.Jan 5 2024, 3:55 PM2024-01-05 15:55:28 (UTC-8) Closed by commit rCOMM82f1b24f9264: [services] [3/n] Rust opensearch indexing lambda (authored by will). · Explain WhyJan 16 2024, 4:08 PM2024-01-16 16:08:23 (UTC-8) This revision was automatically updated to reflect the committed changes.
Revision Contents
Diff 34754 .gitignore
services/search-index-lambda/Cargo.lock
services/search-index-lambda/Cargo.toml
services/search-index-lambda/src/constants.rs
services/search-index-lambda/src/main.rs
services/tunnelbroker/Cargo.lock
|