Comments (8)
Should I try adding Linux here
self-operating-computer/operate/main.py
Line 546 in bc3c5fd
and see what the result is ?
Edit - Because I dont think Key down is as common as windows key...
from self-operating-computer.
@shubhexists This is a known issue. For now, the search command is tailored for Mac OS. Restricting your prompt to reference UI elements which are already on-screen is a work-around. Having a set way to open a launcher/menu via key shortcuts is going to be more difficult in Linux as there isn't one set way to open the launcher/menu like in Mac OS.
One possible way to resolve this issue on GNU/Linux could be to simply prompt the user for their keyboard shortcut to open a launcher/menu/search.
from self-operating-computer.
Hi I solved it and now it is searching applications using the windows key...
If you agree, I can make a PR adding this to the main repository?
Edit - It seems brave has a diff position for search bar than chrome : ) That required manual intermission
Demo
Screencast.2023-12-01.20.58.07.mp4
from self-operating-computer.
Windows Key + type should work in most Linux distros (+ in Windows). I will test it later and maybe also create a PR with a fix for Ubuntu.
from self-operating-computer.
I guess I agree that windows key + type might be an option that may cover a much wider range of devices
from self-operating-computer.
@shubhexists Yeah this is a good default. It personally isn't the key I use to open the menu (I have a keyboard intended for Macs, so I have command/alt open the menu), so I'm probably biased against it. But you are right, this is certainly most common for Linux.
from self-operating-computer.
@shubhexists That looks great! This should be a good PR.
from self-operating-computer.
Merged the PR. Thanks for monitoring @michaelhhogue and adding the code @shubhexists
from self-operating-computer.
Related Issues (20)
- [FEATURE] No update instructions?
- [BUG] WINDOWS install not finding gpt-4-with-ocr HOT 5
- [BUG] Unable to activate the virtual environment
- [BUG] Not running on Ubuntu 22.04.4 LTS HOT 3
- CogVLM Support - A better LLaVa
- [BUG] -m gemini-pro-vision asking for OPENAI_API_KEY HOT 2
- [FEATURE] Add Remote Ollama Capability
- [BUG] Cannot seem to select the right emails to delete.
- [FEATURE] Learning Process HOT 1
- [FEATURE] GUI Interface and further connectivity
- [BUG] operate -m llava return error local variable 'content' referenced before assignment
- [BUG] ModuleNotFoundError: No module named 'pkg_resources' HOT 4
- [BUG] Need GPT-4 ? HOT 1
- [FEATURE] Azure open AI support HOT 2
- OpenSource free Vision model use Instead of openAI HOT 5
- Github
- [Linux]: X get_image failed: error 8 (73, 0, 1316) [Error] --> cannot access local variable 'content' where it is not associated with a value HOT 2
- [For be deleted]
- [BUG] No such file or directory Xauthority
- [BUG] Brief Description of the Issue
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from self-operating-computer.