Add a function and an auxiliary 'enum' (created via Object.freeze) for creating unique
device ID in javascript/flow.
Linear issue: https://linear.app/comm/issue/ENG-1279/initialize-deviceid-on-clients
Details
Run provided tests.
Diff Detail
- Repository
- rCOMM Comm
- Branch
- inka/userID
- Lint
No Lint Coverage - Unit
No Test Coverage
Event Timeline
Right, I'll get more familiar with the Notion notes to get things like this right in the future.
Thanks for changing the UUID to ID @inka !
Following my previous comment, I'm pretty sure that the best name for such function is generateDeviceID and not get... because we are not getting it from the storage or from somewhere, we are generating it.
For me, the current function name is pretty confusing.
web/utils/device-uuid.js | ||
---|---|---|
12 ↗ | (On Diff #14774) |
I've re-checked it, it returns a 64 characters string. Sorry about this, the hex is ok. |
15 ↗ | (On Diff #14774) | The last thing is that we have a format like prefix:suffix, please check the comment in ENG-1279. And you need to check if the value from your function is valid in our regex which I've mentioned in the task comment. |
web/utils/device-uuid.js | ||
---|---|---|
15 ↗ | (On Diff #14774) | Right, I'll fix that. Thank you! |
web/utils/device-id.js | ||
---|---|---|
12 ↗ | (On Diff #14832) | Thanks for the changes! Why we've updated the string to the upper case? |
web/utils/device-id.js | ||
---|---|---|
12 ↗ | (On Diff #14832) | The regex mentioned by @max https://github.com/CommE2E/comm/blob/f074a9c13b3ba5c33cbdb9a4c1bde54f0d7330ce/services/tunnelbroker/src/Constants.h#L48 contains 64 chars [a-zA-Z0-9] (so about 62 chars to choose from). It gives 62^64 (about 2^384) possible ids. The solution with 32 random bytes gives only 8^32 = 2^96 possible ids. It is still plenty, but we decrease the number significantly. Could you find a solution where we choose from the whole available space of valid ids? |
web/utils/device-id.js | ||
---|---|---|
12 ↗ | (On Diff #14832) | I've changed to uppercase because on mobile we already had a function creating random hex strings but it produces uppercase hex. Regardless, I'll change that according to Tomek's comment, so it's no longer relevant. |
web/utils/device-id.js | ||
---|---|---|
15–21 | Splitting this string isn't necessary - you can use charAt function of string | |
16–19 | It is a good idea to give every constant a meaningful name - create a const. It is especially important when the same value is used in multiple places. | |
20 | The generated chars aren't uniformly distributed: the number of available symbols doesn't divide number of possible values of byte (62 doesn't divide 256). This causes an issue when using this logic, because some chars would appear more often than others (0 will be used when byte value is 0, 62, 124, 186, and 248, at the same time Z will be used when the value is 61, 123, 185, and 247, so we can expect 0 to be 25% more frequent than Z). Generating secure, uniform string is complicated and it might be a good idea to search for a library that does that. But in our case the uniformity isn't required - the lack of it slightly changes the probability of some ids, but it shouldn't be a threat to our security. I'd probably keep the current implementation, but I'm also curious what @ashoat thinks about it. |
web/utils/device-id.js | ||
---|---|---|
20 | We have code that does it in the same way in native/cpp/CommonCpp/CryptoTools/Tools.cpp (https://github.com/CommE2E/comm/blob/master/native/cpp/CommonCpp/CryptoTools/Tools.cpp#:~:text=Tools%3A%3A-,generateRandomString,-(size_t%20size%2C) |
web/utils/device-id.js | ||
---|---|---|
31 ↗ | (On Diff #14959) | It might be a good idea to return null here, or even throw - we should never expect this code to be executed. Probably returning null is better, because the code that calls this function can then decide what to do with the result - if throwing is a good solution. Returning an empty string isn't ideal solution because the client should be warned that this function might fail. If we simply return empty string, it can get accepted and used without checking if it's empty - for optional string an explicit action needs to be performed. |
Adding @ashoat because we need to decide if having non-uniform distribution of ids is good enough for us.
I think we should find a way to generate a uniform distribution of IDs here... hopefully it isn't too hard. I think this is a good idea:
Generating secure, uniform string is complicated and it might be a good idea to search for a library that does that.
If this ends up being too difficult, I'm open to landing this. But would be great to do a little bit of research first to see if we can find something.
web/utils/device-id.js | ||
---|---|---|
27 ↗ | (On Diff #15264) | What do you think about making it consistent with generateRandomString by adding an invariant at the beginning of the function? |
web/utils/device-id.test.js | ||
10 ↗ | (On Diff #15264) | Maybe use deviceIDTypes.KEYSERVER instead? |
13 ↗ | (On Diff #15264) | Using https://developer.mozilla.org/en-US/docs/Web/JavaScript/Reference/Template_literals could make this more readable |
web/utils/device-id.js | ||
---|---|---|
27 ↗ | (On Diff #15264) | We should avoid throwing raw strings, please throw Errors |
web/utils/device-id.js | ||
---|---|---|
27 ↗ | (On Diff #15264) | Since it is not possible anyway for type to not be one of deviceIDTypes.KEYSERVER, deviceIDTypes.WEB, deviceIDTypes.MOBILE (as it is of type DeviceIDType), and the throw was only there so that flow wouldn't show "Cannot expect string as the return type of function because string [1] is incompatible with implicitly-returned undefined." error, I'll just change the last else if to an else. |
This looks great and we're really close to finishing it!
web/utils/device-id.js | ||
---|---|---|
20–26 ↗ | (On Diff #15301) | We're using false so that an invariant would always be violated. We want that because, in every valid scenario, we should return before it. |
27 ↗ | (On Diff #15264) | The issue with this approach is that introducing new DeviceIDType becomes a lot harder. This function won't warn a programmer that a new value should be handled in a different way and instead would happily return mobile... for it. |
web/utils/device-id.test.js | ||
---|---|---|
28 ↗ | (On Diff #15308) | It seems like we're repeating the same regex 3 times. It's a good point to create one and reuse it. |
web/utils/device-id.js | ||
---|---|---|
30 ↗ | (On Diff #15308) | I think since the function accepts DeviceIDType, and as such accepts only values that actually are present in the deviceIDTypes, reaching this point means that the function doesn't handle all types declared in deviceIDTypes. So it actually makes more sense to say "Unhandled" - it's an information for the developer that the function has to be corrected, to handle all declared types. |
web/utils/device-id.js | ||
---|---|---|
30 ↗ | (On Diff #15308) |
Ok, lets use the original one. |