Paths

Table of Contentst

-
services/backup/src/service/handlers/
-
backup/
-
src/
-
service/
-
handlers/
-
pull_backup.rs

[services][backup] PullBackup 5/5 - blob download stream
ClosedPublic
Actions

Authored by bartek on Jan 11 2023, 3:48 PM.

Details

Reviewers

tomek
varun
• jon
michal

Commits

rCOMM357c69fe954e: [services][backup] PullBackup 5/5 - blob download stream

Summary

This diff leverages both ResponseBuffer and BlobStoredItem to create a generic stream that downloads the item from blob service, buffers the data and sends in chunks not exceeding the gRPC message size. A few notes:

First message includes "additional extra info" - fields like attachment holders etc. Rest receive only ID and data chunk
The data is buffered before sending, this is the solution 2 from https://phab.comm.dev/D4439#126984
More data is not downloaded when the buffer is already filled enough
Stream ends when there is no more data

The order of instructions is important here:

Push new data if not full
Finish if empty
Pop if not empty

Depends on D6243

Test Plan

At this point, the whole service can be tested with integration tests

cd services
yarn reset-local-cloud
nix run .#comm-blob
RUST_LOG=backup=trace cargo run -- --port 50052 --sandbox --blob-service-url "http://localhost:50051"
yarn run-integration-tests backup

Diff Detail

Repository

rCOMM Comm

Lint

Lint Not Applicable

Unit

Tests Not Applicable

Event Timeline

bartek created this revision.Jan 11 2023, 3:48 PM

bartek held this revision as a draft.

Herald added subscribers: atul, ashoat. · View Herald TranscriptJan 11 2023, 3:48 PM

bartek added a child revision: D6247: [services][backup] PullBackup - fix tracing span.Jan 11 2023, 3:54 PM

bartek published this revision for review.Jan 11 2023, 3:57 PM

Harbormaster completed remote builds in B15274: Diff 20830.Jan 11 2023, 4:02 PM

Looks great!

services/backup/src/service/handlers/pull_backup.rs
128 ↗	(On Diff #20830)	This API looks unintuitive: it seems like we're taking `extra_bytes` bytes from the buffer. Can we modify the API to be less confusing?
135 ↗	(On Diff #20830)	We can make it slightly more efficient by waiting with sending for the buffer to get saturated or the client to become wholly read. But I don't think it is worth it.

This revision is now accepted and ready to land.Jan 12 2023, 9:32 AM

bartek added inline comments.Jan 12 2023, 11:40 PM

services/backup/src/service/handlers/pull_backup.rs
128 ↗	(On Diff #20830)	maybe let padding = item.metadata_size(is_first_chunk); let chunk = buffer.get_chunk(padding); ?
135 ↗	(On Diff #20830)	I don't think so. Response data chunks are slightly smaller than those written to the buffer - it gets saturated/desaturated nearly every time, so one write + one read per iteration is a good balance. Batching it would block either blob service or backup client waiting for each other finishes sending/receiving

bartek added inline comments.Jan 12 2023, 11:54 PM

services/backup/src/service/handlers/pull_backup.rs
128 ↗	(On Diff #20830)	Another idea

Minor renames to make the API more intuitive

Harbormaster failed remote builds in B15346: Diff 20926!Jan 13 2023, 4:00 AM

bartek added a child revision: D6262: [services][backup] Use Rust service in Dockerfile.Jan 13 2023, 12:06 PM

tomek accepted this revision.Jan 16 2023, 8:34 AM

tomek added inline comments.

services/backup/src/service/handlers/pull_backup.rs
128 ↗	(On Diff #20830)	`metadata_size` is a lot better name - thanks! As for `get_with_padding` it is still a bit confusing: someone can think that a padding is added to a result (so size + padding bytes are returned in total). So for me it's better to keep it as is.

Rebase

Harbormaster completed remote builds in B15476: Diff 21098.Jan 19 2023, 10:03 AM

Closed by commit rCOMM357c69fe954e: [services][backup] PullBackup 5/5 - blob download stream (authored by bartek). · Explain WhyJan 19 2023, 10:08 AM

This revision was automatically updated to reflect the committed changes.

bartek added a commit: rCOMM357c69fe954e: [services][backup] PullBackup 5/5 - blob download stream.

Revision Contents
Changeset List

Path

Size

services/

backup/

src/

service/

handlers/

pull_backup.rs

64 lines

Diff 21114

View Options

services/backup/src/service/handlers/pull_backup.rs

Show First 20 Lines • Show All 638 Lines • ▼ Show 20 Lines	return {
cookieInsertedThisRequest: true,		cookieInsertedThisRequest: true,
isScriptViewer: false,		isScriptViewer: false,
};		};
}		}

type UserCookieCreationParams = {		type UserCookieCreationParams = {
platformDetails: PlatformDetails,		platformDetails: PlatformDetails,
deviceToken?: ?string,		deviceToken?: ?string,
		primaryIdentityPublicKey?: ?string,
};		};

// The result of this function should never be passed directly to the Viewer		// The result of this function should never be passed directly to the Viewer
// constructor. Instead, it should be passed to viewer.setNewCookie. There are		// constructor. Instead, it should be passed to viewer.setNewCookie. There are
// several fields on UserViewerData that are not set by this function:		// several fields on UserViewerData that are not set by this function:
// sessionID, sessionIdentifierType, cookieSource, and ipAddress. These		// sessionID, sessionIdentifierType, cookieSource, and ipAddress. These
// parameters all depend on the initial request. If the result of this function		// parameters all depend on the initial request. If the result of this function
// is passed to the Viewer constructor directly, the resultant Viewer object		// is passed to the Viewer constructor directly, the resultant Viewer object
// will throw whenever anybody attempts to access the relevant properties.		// will throw whenever anybody attempts to access the relevant properties.
async function createNewUserCookie(		async function createNewUserCookie(
userID: string,		userID: string,
params: UserCookieCreationParams,		params: UserCookieCreationParams,
): Promise<UserViewerData> {		): Promise<UserViewerData> {
const { platformDetails, deviceToken } = params;		const { platformDetails, deviceToken, primaryIdentityPublicKey } = params;
const { platform, ...versions } = platformDetails \|\| defaultPlatformDetails;		const { platform, ...versions } = platformDetails \|\| defaultPlatformDetails;
const versionsString =		const versionsString =
Object.keys(versions).length > 0 ? JSON.stringify(versions) : null;		Object.keys(versions).length > 0 ? JSON.stringify(versions) : null;

const time = Date.now();		const time = Date.now();
const cookiePassword = crypto.randomBytes(32).toString('hex');		const cookiePassword = crypto.randomBytes(32).toString('hex');
const cookieHash = bcrypt.hashSync(cookiePassword);		const cookieHash = bcrypt.hashSync(cookiePassword);
const [[cookieID]] = await Promise.all([		const [[cookieID]] = await Promise.all([
createIDs('cookies', 1),		createIDs('cookies', 1),
deviceToken ? clearDeviceToken(deviceToken) : undefined,		deviceToken ? clearDeviceToken(deviceToken) : undefined,
]);		]);

const cookieRow = [		const cookieRow = [
cookieID,		cookieID,
cookieHash,		cookieHash,
userID,		userID,
platform,		platform,
time,		time,
time,		time,
deviceToken,		deviceToken,
versionsString,		versionsString,
		primaryIdentityPublicKey,
];		];
const query = SQL`		const query = SQL`
INSERT INTO cookies(id, hash, user, platform, creation_time, last_used,		INSERT INTO cookies(id, hash, user, platform, creation_time, last_used,
device_token, versions)		device_token, versions, public_key)
VALUES ${[cookieRow]}		VALUES ${[cookieRow]}
`;		`;
await dbQuery(query);		await dbQuery(query);
return {		return {
loggedIn: true,		loggedIn: true,
id: userID,		id: userID,
platformDetails,		platformDetails,
deviceToken,		deviceToken,
▲ Show 20 Lines • Show All 142 Lines • Show Last 20 Lines

[services][backup] PullBackup 5/5 - blob download streamClosedPublicActions