
-
MOSH, inc.
- tokyo
- @PENGUINANA_
Starred repositories
🔊 Text-Prompted Generative Audio Model
A simple screen parsing tool towards pure vision based GUI agent
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…
Get started with building Fullstack Agents using Gemini 2.5 and LangGraph
Examples and guides for using the Gemini API
Visualizations for machine learning datasets
MARS5 speech model (TTS) from CAMB.AI
Tailored cloud solutions based on use case, cost, and preferences using natural language with Agentic AI to research, design, price, diagram, and report an optimized result.