Paths

Table of Contentst

-
lib/shared/
-
shared/
-
radix-tree.js
-
radix-tree.test.js

[lib] Introduce RadixTree
ClosedPublic
Actions

Authored by ashoat on Oct 27 2023, 11:30 AM.

Details

Reviewers

tomek
atul
inka
rohan

Commits

rCOMM755557751d79: [lib] Introduce RadixTree

Summary

We currently use a rather naive approach for prefix search, where we simply add each substring to a HashMap.

This diff introduces a new data structure optimized for prefix search: the radix tree.

After the improvements in the parent diff, this improve the perf of useChatMentionSearchIndex by 35%. Measuring from before the parent diff, the incremental improvement is about 19%. Combined with the parent diff, in total we're improving perf by 64%.

Linear tasks: ENG-5137 and ENG-5480

Depends on D9625

Test Plan

Included unit tests
I tested the chat mentions experience
I did some perf testing:

In combination with the following diff, I used this patch to test performance before and after this change. I made sure I had at least three samples of each scenario. Will also link my messy Gist of results, but it's not really interpretable by anyone other than me.

Here's the relevant portion:

BEFORE

 LOG  useChatMentionSearchIndex took 1801ms
 LOG  useChatMentionSearchIndex took 1748ms
 LOG  useChatMentionSearchIndex took 1730ms
 LOG  useChatMentionSearchIndex took 1831ms

 AVERAGE 1777.5ms

JUST DEDUP (parent diff)

 LOG  useChatMentionSearchIndex took 1027ms
 LOG  useChatMentionSearchIndex took 949ms
 LOG  useChatMentionSearchIndex took 957ms

 AVERAGE 977.7ms

DEDUP + RADIX TREE

 LOG  useChatMentionSearchIndex took 643ms
 LOG  useChatMentionSearchIndex took 629ms
 LOG  useChatMentionSearchIndex took 651ms
 LOG  useChatMentionSearchIndex took 609ms

 AVERAGE 633ms

JUST RADIX TREE

 LOG  useChatMentionSearchIndex took 1394ms
 LOG  useChatMentionSearchIndex took 1468ms
 LOG  useChatMentionSearchIndex took 1511ms
 LOG  useChatMentionSearchIndex took 1492ms
 LOG  useChatMentionSearchIndex took 1397ms

 AVERAGE 1452.4ms

Diff Detail

Repository

rCOMM Comm

Lint

Lint Not Applicable

Unit

Tests Not Applicable

Event Timeline

ashoat created this revision.Oct 27 2023, 11:30 AM

Herald added a subscriber: will. · View Herald TranscriptOct 27 2023, 11:30 AM

ashoat edited the test plan for this revision. (Show Details)Oct 27 2023, 11:31 AM

ashoat added a child revision: D9627: [lib] Use RadixTree in SearchIndex.Oct 27 2023, 11:32 AM

Worth mentioning that I spent like 2-3 hours combing through pretty much every single relevant NPM package before implementing this myself. Keywords I searched for (in various combinations) included "radix tree", "trie", "prefix search", and "patricia".

I found a couple packages that weren't ancient, but mostly they didn't support storing a set of results for a given keywords... just storing the keyword itself. I could've made that work with an additional hashmap, but I didn't really want to bother with needing a second data structure.

There was one package I found that could store a set of results, but it had strange behavior when I tested it.

lib/shared/radix-tree.js
5–14 ↗	(On Diff #32475)	These are purposefully not read-only. These objects never get exposed outside of this class (unless somebody accesses `root` directly), and it improves perf to modify them in-place rather than having to construct new ones every time

Harbormaster completed remote builds in B23550: Diff 32475.Oct 27 2023, 11:53 AM

ashoat requested review of this revision.Oct 27 2023, 11:53 AM

Measuring from before the parent diff, the incremental improvement is about 19%.

I guess there should be also a significant improvement in memory usage

lib/shared/radix-tree.js
41 ↗	(On Diff #32475)	We can avoid this iteration by replacing an array with a map from the first character to a node (for `children` prop. `values` should probably stay as is). This should improve the performance and simplify the code.
49 ↗	(On Diff #32475)	Doesn't matter at all, but it looks safer to first test the length and then the first char
137 ↗	(On Diff #32475)	It doesn't seem necessary to use the stack in this case, as it will always have 0 or 1 element because at most one child of a node can be considered as a possible match. In each `while` iteration, we're removing one item from a stack and adding up to one item - we can replace the stack with a single value.
145 ↗	(On Diff #32475)	This can match at most once per `while` iteration because every `child` from `node.children` starts with a different char.

This revision is now accepted and ready to land.Oct 30 2023, 4:04 AM

@tomek’s nit about ordering about conditions
@tomek’s feedback about not needing a stack for exact match
@tomek’s feedback to use a map from first char for children (instead of an array) to improve perf
To maintain compatibility with old approach, I updated the code to "dedup" identical values using Sets
To reduce unnecessary memory usage, I got rid of leaf: false

I tested performance again and it's about the same as it was in the last revision of the diff

Harbormaster failed remote builds in B23576: Diff 32508!Oct 30 2023, 11:58 AM

Lint fixes

Harbormaster completed remote builds in B23578: Diff 32510.Oct 30 2023, 12:22 PM

Closed by commit rCOMM755557751d79: [lib] Introduce RadixTree (authored by ashoat). · Explain WhyOct 30 2023, 12:24 PM

This revision was automatically updated to reflect the committed changes.

ashoat added a commit: rCOMM755557751d79: [lib] Introduce RadixTree.

ashoat added inline comments.Oct 31 2023, 4:35 AM

lib/shared/radix-tree.js
140	Creating a new set here is wasteful... I was doing it this way before I changed `RadixTreeLeafNode` to store a set instead of an array, but I forgot to change it back afterwards

ashoat added inline comments.Oct 31 2023, 5:17 AM

lib/shared/radix-tree.js
140	Solved in D9647

Revision Contents
Changeset List

Path

Size

lib/

shared/

radix-tree.js

150 lines

radix-tree.test.js

199 lines

Diff 32513

View Options

lib/shared/radix-tree.js

// @flow		// @flow

import * as React from 'react';		import * as React from 'react';
import { Switch, Text, View } from 'react-native';		import { Alert, Switch, Text, View } from 'react-native';
import { ScrollView } from 'react-native-gesture-handler';		import { ScrollView } from 'react-native-gesture-handler';
import { useDispatch } from 'react-redux';		import { useDispatch } from 'react-redux';

		import { getMessageForException } from 'lib/utils/errors.js';
		import { entries } from 'lib/utils/objects.js';

		import { useClientBackup } from '../backup/use-client-backup.js';
		import Button from '../components/button.react.js';
import { setLocalSettingsActionType } from '../redux/action-types.js';		import { setLocalSettingsActionType } from '../redux/action-types.js';
import { useSelector } from '../redux/redux-utils.js';		import { useSelector } from '../redux/redux-utils.js';
import { useStyles } from '../themes/colors.js';		import { useColors, useStyles } from '../themes/colors.js';

// eslint-disable-next-line no-unused-vars		// eslint-disable-next-line no-unused-vars
function BackupMenu(props: { ... }): React.Node {		function BackupMenu(props: { ... }): React.Node {
const styles = useStyles(unboundStyles);		const styles = useStyles(unboundStyles);
const dispatch = useDispatch();		const dispatch = useDispatch();
		const colors = useColors();

		const userStore = useSelector(state => state.userStore);
const isBackupEnabled = useSelector(		const isBackupEnabled = useSelector(
state => state.localSettings.isBackupEnabled,		state => state.localSettings.isBackupEnabled,
);		);

		const { restoreBackupProtocol } = useClientBackup();

		const testRestore = React.useCallback(async () => {
		let message;
		try {
		const result = await restoreBackupProtocol({ userStore });
		message = entries(result)
		.map(([key, value]) => `${key}: ${String(value)}`)
		.join('\n');
		} catch (e) {
		console.error(`Backup uploading error: ${e}`);
		message = `Backup restore error: ${String(getMessageForException(e))}`;
		}
		Alert.alert('Restore protocol result', message);
		}, [restoreBackupProtocol, userStore]);

const onBackupToggled = React.useCallback(		const onBackupToggled = React.useCallback(
value => {		value => {
dispatch({		dispatch({
type: setLocalSettingsActionType,		type: setLocalSettingsActionType,
payload: { isBackupEnabled: value },		payload: { isBackupEnabled: value },
});		});
},		},
[dispatch],		[dispatch],
);		);

return (		return (
<ScrollView		<ScrollView
contentContainerStyle={styles.scrollViewContentContainer}		contentContainerStyle={styles.scrollViewContentContainer}
style={styles.scrollView}		style={styles.scrollView}
>		>
<Text style={styles.header}>SETTINGS</Text>		<Text style={styles.header}>SETTINGS</Text>
<View style={styles.section}>		<View style={styles.section}>
<View style={styles.submenuButton}>		<View style={styles.submenuButton}>
<Text style={styles.submenuText}>Toggle automatic backup</Text>		<Text style={styles.submenuText}>Toggle automatic backup</Text>
<Switch value={isBackupEnabled} onValueChange={onBackupToggled} />		<Switch value={isBackupEnabled} onValueChange={onBackupToggled} />
</View>		</View>
</View>		</View>

		<Text style={styles.header}>ACTIONS</Text>
		<View style={styles.section}>
		<Button
		onPress={testRestore}
		style={styles.row}
		iosFormat="highlight"
		iosHighlightUnderlayColor={colors.panelIosHighlightUnderlay}
		iosActiveOpacity={0.85}
		>
		<Text style={styles.submenuText}>Test backup restore protocol</Text>
		</Button>
		</View>
</ScrollView>		</ScrollView>
);		);
}		}

const unboundStyles = {		const unboundStyles = {
scrollViewContentContainer: {		scrollViewContentContainer: {
paddingTop: 24,		paddingTop: 24,
},		},
Show All 21 Lines	submenuButton: {
paddingVertical: 10,		paddingVertical: 10,
alignItems: 'center',		alignItems: 'center',
},		},
submenuText: {		submenuText: {
color: 'panelForegroundLabel',		color: 'panelForegroundLabel',
flex: 1,		flex: 1,
fontSize: 16,		fontSize: 16,
},		},
		row: {
		flexDirection: 'row',
		justifyContent: 'space-between',
		paddingHorizontal: 24,
		paddingVertical: 14,
		},
};		};

export default BackupMenu;		export default BackupMenu;