Skip to content

Latest commit

 

History

History
81 lines (62 loc) · 5.48 KB

README.md

File metadata and controls

81 lines (62 loc) · 5.48 KB

Hindsight

Look back on your digital experience with Hindsight! (very alpha)


Overview

Hindsight is an android app that takes a screenshot every 2 seconds. The server upload functionality allows a user to upload the screenshots to a server running on their personal computer.

Communication

Join us on Discord

Setup an onboarding session or just chat about the project here

Demos

Setup

Querying

For querying setup you will need to setup the huggingface cli and if you want to use a restricted model, such as llama, you will need to request access on huggingface. You can change the LLM model within hindsight_server/config.py.

Viewing Your Data

Enter the hindsight_server directory by running: cd hindsight_server

  • python views/timeline_view.py: To view the screenshots on a timeline
  • python views/search_view.py: To search the screenshots by text or embedding
  • python views/query_view.py: To query the screenshots using a local LLM

Features

Currently the app has 4 "working" functionalities.

  1. Screen Recording: Toggle screen recording on or off.
  2. Location Tracking: Will save the last retrieved location every 2 seconds (if it has changed). Currently only works if screen recording is also enabled. Also be sure to give location tracking permissions.
  3. Server Upload: Upload screenshots directly to your server.
  4. Query: Run natural language queries against the text in your screenshots. There are 3 types of querying techniques currently available:
    1. Basic (Default or start query with b/): Uses top N contexts retreived by chromadb and feeds them to a single LLM call
    2. Long Context (start query with l/): For each top N contexts grabs the frames immediately before and after. For each context, the frames are combined and fed to an LLM. Finally, all of the results are fed to a summary LLM call to generate the response.
    3. Decomposition (start query with d/): First creates prompt asking LLM to generate prompts to gain context for the query. Then, it run Long Context on each of these sub-prompts. Finally, it combines all of the results of the sub-prompts and feeds it to the LLM with the user query.
    4. All (start query with a/): Runs all of the above methods and passes answers as a prompt asking to chose the best answer and report the method the answer came from.

Developing

You can easily build applications on top of Hindsight. To access the database you can use the functions within hindsight_server/db.py:

from db import HindsightDB

db = HindsightDB()

frames_df = db.get_frames()

search_results = db.search(text="hindsight")

App Settings

  • User Activity Detection: This setting is recommended as, depending on your phone usage, it can signifantly save battery life. The app will only take a screenshot if the user has been active since the last screenshot.
  • Screenshots Per Auto Upload This setting determines how many screenshots taken in a row before the app automatically attempts to upload to the server. (Could add setting to disable)

Server Settings

  • Add Android Application Alias Can add android identifier aliases to the hindsight_servers/res/android_identifers.json file. Note: This file will only be updated each time the server backend is started.

Permissions

  • Screen Broadcasting: The App requires screen broadcasting each time recording is started. Ideally this is only needed if the phone is turned off or if the user intentionally stops recording (pausing is available as well). However, the app is very buggy so it may be required more often.
  • Accessibility Access: In order to use the Only Take Screenshot when User is Active setting and for the screenshot to be saved with the name of the application running, Accessibility access is required. Unfortunetly, this is the only robust way I could find to introduce this functionality.

Contributions

Feedback and contributions are welcome! Please reach out if you have ideas or want to help improve Hindsight.

Considerations

  • Security: Lots of security concerns so would love feedback in that area!
  • Battery Usage: It uses around 2% of my new google pixel8's battery. Hopefully this can be reduced even more, and I'm very curious if battery life will be too big of a limitation for other phones.
  • iPhone: Unfortunetly, this would not work on iPhone as the broadcast functionality would require user permission each time the phone screen turns off.
  • Testing: This has only been tested on a Pixel 8 and M3 Mac