Anahata Yam
"Yet Another Module" - The creative and experimental laboratory for multimodal agentic tools.
Yam provides specialized toolkits that extend the ASI's reach into external environments, from web automation to hardware interaction.
Chrome Automation
The Chrome toolkit leverages Selenium WebDriver to give the ASI full "eyes and hands" on the web. It is engineered for secure, debug-mode interactions with an existing browser profile.
Visual Reasoning
The ASI can take screenshots of any page or specific element, enabling it to "see" UI layouts and debug front-end issues visually.
Form Metabolism
Surgical form-filling capabilities allow the model to navigate complex multi-step workflows or automate repetitive data entry tasks.
The Connection Loop
- Automatically detects running Chrome instances.
- Restarts Chrome in
--remote-debugging-portmode if necessary. - Maintains session persistence via a dedicated "Anahata" profile.
Internet Radio
The Radio toolkit provides a direct audio feed for the developer's "Metabolic Flow." Integrated with the SomaFM API and high-quality streams like KEXP and FIP Paris.
Station Curation
Deeply integrated SomaFM channels (Groove Salad, Drone Zone, Lush) provided as native system instructions.
Device Targeting
Synchronous hardware line selection ensures audio is routed to the correct output device (HDMI, Headphones, System) without OS intervention.
Speech Synthesis (TTS)
Leveraging the FreeTTS engine, the Speech toolkit allows the ASI to communicate via high-salience audio alerts or full text-to-speech synthesis.
Designed for "Eyes-Free" debugging and status notifications, the toolkit provides immediate feedback on long-running tasks or critical errors using the speak(text) tool.
The Experimental Lab
Yam is the birthplace of all new multimodal capabilities. From Android device control (ADB integration) to advanced image processing, it serves as the incubation chamber for the next generation of Anahata tools.