xrem
Cross-platform in-progress implementation of rem
Way too early to use- seriously, it's not useful yet.
But I want to help develop it!
Oh! That'd be awesome. Just run:
npm run tauri dev -- --release
Does a lot worse in debug mode in terms of performance.
"implemented" (read: likely terrible)
Currently implements, in a parallel / non-blocking way:
- click the status icon and choose to start / stop recording
- screenshot capture every 2 seconds
- OCR at capture time
- calculate text embedding (rem doesn't have this yet lol, and this is commented out anyway for now, but it works)
- stream to mp4 without writing pngs to disk
- basic tray icon + menu
- efficient timeline seeking of a recorded data (with front-end)
- view and "search" history as thumbnails: i put it in quotes because search is not working well yet
- navigate to timeline frame by clicking search result
NOTE:
- NO CACHING YET (this is vital for fast seeking between video files, currently big delay when swapping chunks)
(more recent screenshot of tray - still using default tauri logo)
Recent "search" (recent items) functionality
https://github.com/jasonjmcghee/xrem/assets/1522149/4c8dbff9-4a85-47d1-a0a7-51059f624659
More recent timeline seeking demo
https://github.com/jasonjmcghee/xrem/assets/1522149/4d551500-c905-453c-b35b-83ca969c5159
First Demo of basic poc:
Taking screenshots + ocr transcript printed to terminal, screenshots streamed to videos and saved, then manually showing this by scrubbing.
https://github.com/jasonjmcghee/xrem/assets/1522149/bbf7903a-77ae-4540-85c5-9430c05355fc
It can keep up on my M1 Air, haven't tested elsewhere yet...
BUT, this is using cross-platform rust libraries for the functionality we need.
FAQ
- when will it be ready?
- idk, but with your help it might go faster
- why is the current ocr solution screaming about ARNs?
- idk, if it was AWS's textract it might make more sense
- not having live text analysis (MacOS) seems like selecting from past screenshots will be much harder to build
- yeah
not implemented at all
- Live OCR
- overlay transparent text on image so as to be selectable (potentially works on Mac due to webview, but won't work elsewhere)
- better search UI (matched text - not working well, date, application?)
- natural language search
- settings
- window-specific OCR / filtering