Modal is a serverless cloud platform designed for AI, ML, and data applications. Built for developers, it offers high-performance infrastructure to run generative AI models, large-scale batch jobs, job queues, and more. Key features include instant code changes and rebuilds, scalability to hundreds of GPUs, fast cold boots, and seamless autoscaling. Modal supports various use cases, such as generative AI inference, fine-tuning, batch processing, and computational biology. The platform prioritizes developer experience and offers flexible environments, seamless integrations, data storage solutions, job scheduling, web endpoints, and built-in debugging tools. Pricing is usage-based, paying only for compute time.
• seamless integrations
• data storage
• built-in debugging
• web endpoints
• job scheduling
• flexible environments
Billable time is the time your code is actively using compute resources. Idle containers do not incur charges.
CPU usage is metered per physical core, per second. Memory usage is metered per GiB, per second.
A container using 0.5 cores for 1 minute will be billed for 0.5 cores * 60 seconds * $0.000038/core/sec = $0.0114.
A container using 1 H100 GPU for 10 seconds will be billed for 10 seconds * $0.001267/sec = $0.01267.
No, the Starter plan is limited to three users. Please upgrade to a Team plan for more users.
Modal can run various applications, including LLM APIs, image generation, Parquet file analysis, and LLM-powered bots.
Contact our sales team for an enterprise quote. They'll help determine the best plan for your organization's needs and scale.
No, Modal currently does not charge for storage.
• $30 / month free credits
• 3 workspace seats included
• 100 containers + 10 GPU concurrency
• Crons and web endpoints (limited)
• Real-time metrics and logs
• $100 / month free credits
• Unlimited seats
• 1000 containers + 30 GPU concurrency
• Unlimited crons and web endpoints
• Custom domains
• Region selection
• Static IP proxy
• Deployment rollbacks
• Custom monthly free compute
• Unlimited seats
• Custom GPU concurrency
• Support via private Slack
• Personalized integration help
• Audit logs, Okta SSO, and HIPAA
Forward Deployed Engineer
Modal: High-performance serverless cloud for AI and ML applications. Pay-per-use pricing, easy scaling, and developer-focused.
Benefits:
Full medical, dental, vision insurance
Competitive salary and equity
Experience Requirements:
Experience working with AI applications
At least a few years professional software engineering experience
Other Requirements:
Willing to work in-person in New York City (SF also an option for very strong candidates)
Responsibilities:
Help our customers architect and build complex AI applications
Optimize performance for open-source models and frameworks
Write examples and build demos that showcase Modal
Contribute to the core Modal stack
Help our community build cool stuff on top of Modal
Show more details
Member of Technical Staff - ML Performance
Modal: High-performance serverless cloud for AI and ML applications. Pay-per-use pricing, easy scaling, and developer-focused.
Benefits:
Full medical, dental, vision insurance
Competitive salary and equity
Experience Requirements:
5+ years of experience writing high-quality, high-performance code
Experience working with torch, high-level ML frameworks, and inference engines (vLLM or TensorRT)
Familiarity with Nvidia GPU architecture and CUDA
Experience with ML performance engineering
Other Requirements:
Work in-person, in our NYC, San Francisco or Stockholm office
Show more details
Member of Technical Staff - Product (Frontend)
Modal: High-performance serverless cloud for AI and ML applications. Pay-per-use pricing, easy scaling, and developer-focused.
Benefits:
Full medical, dental, vision insurance
Competitive salary and equity
Experience Requirements:
2+ years of full-time software engineering experience
Experience building applications with a modern front-end Javascript framework such as React
Ability to build pixel-perfect components and polished interactions
Strong product sense and experience driving product outcomes
Strong communication skills and a desire to partner with our customers in solving their problems
Other Requirements:
Work in-person, in either our NYC or Stockholm office
Ability to partner closely with product design to craft delightful user experiences
Show more details
Average Rating: 0.0
5 Stars:
0 Ratings
4 Stars:
0 Ratings
3 Stars:
0 Ratings
2 Stars:
0 Ratings
1 Star:
0 Ratings
No ratings available.
Syslogic provides ruggedized embedded computers and AI edge devices powered by NVIDIA and Intel technologies for demanding applications.
View DetailsRun AI functions on-device to drastically reduce costs. Pay only $0.01 per unique device download.
View DetailsCerebras Systems designs and builds wafer-scale AI supercomputers for faster deep learning training and inference, offering open-source models and cloud services.
View DetailsAnyscale is a fully-managed compute platform for Ray, simplifying AI/ML development and deployment at scale, from laptops to data centers. It offers features like RayTurbo, optimized for performance and cost efficiency.
View DetailsUP Bridge the Gap offers single-board computers (SBCs) and AI solutions for professional developers, supporting projects from development to mass production.
View DetailsAnonymous, uncensored AI chat with AES encryption and no logs. Offers free and pro plans.
View DetailsWayin AI summarizes videos, supports multiple languages, and allows interactive Q&A via chatbot and screenshot queries.
View DetailsPokecut is a free AI-powered photo editor with tools for background removal, changing, and enhancement. Pro plans offer extra features and credits.
View DetailsConnect your Github repos to ChatGPT & Claude for code assistance, bug finding, and documentation. Free trial available.
View DetailsCreate and interact with a customizable AI girlfriend. Features include AI chat, roleplay, and image generation. NSFW content available.
View DetailsA trivia website with questions in multiple categories. Play now and expand your knowledge!
View DetailsArbor is an automated carbon accounting platform that helps businesses measure, analyze, and reduce their product's carbon footprint quickly and accurately.
View DetailsPhotoLog offers secure, client-side encrypted media storage with mini-site creation, easy sharing, and various storage plans.
View DetailsAI-powered mobile app testing platform with a test automation cloud (Ptero) and a no-code test scenario authoring tool (Stego).
View DetailsAI-powered productivity assistant for ADHD and knowledge workers, centralizing notes, tasks, and AI tools to enhance focus and efficiency.
View Details