KL3M is the first clean LLM, focusing on language models without IP or toxicity issues. It's trained on high-quality content with clear provenance, avoiding copyright violations, contract breaches, synthetic data from other LLMs, and toxic sources. KL3M models are efficient, even outperforming larger models on certain tasks. They are already being used for drafting and revising legal documents, SEC filings, and more. KL3M offers several uses, including continued training on your content, fine-tuning for various purposes, and licensing of its training data.
• high-quality content
• no copyright issues
• no toxic sources
• no llm synthetic data
• no breach of contract
• clean provenance
• fairly trained
The first KL3M models have been designed with accessible use as a priority. kl3m-170 runs quickly on a MacBook Air M1, and kl3m-1.7b runs well on a $300 consumer GPU.
Smaller KL3M models are trained using the GPT-NeoX architecture. Larger KL3M models are trained using the Mixtral Mixture-of-Experts architecture (trained from scratch).
KL3M is distributed as standard PyTorch model weights. KL3M architectures are supported for both HuggingFace transformers and vllm for inference.
`kl3m-170m` and `kl3m-1.7b` have both been trained on a predominantly English-language content. Larger models include content in English, Spanish (es-ES and es-MX), French, and German. We are working on adding more languages.
Not yet. Our focus has been on enabling the use of small, local LLMs for information security and accessibility purposes, but we are evaluating the possibility of providing an API in the future.
We have had excellent results fine-tuning KL3M on a number of use cases, including drafting, summarization, and classification. You can fine-tune kl3m-170 and kl3m-1.7b on consumer hardware.
We have collected over 2.5 trillion tokens of training data, and we are constantly adding more. Our training data is a mix of public domain and explicitly licensed content.
`kl3m-170m` and `kl3m-1.7b` have been trained on approximately 350B tokens of primarily English-language content. Larger models are being trained on between 500B to 1T tokens of content in English, Spanish, French, and German.
Not yet. While our pretraining data does include a number of conversational sources, we have not yet trained a model that is designed for standard conversational rounds. Stay tuned.
Our base models already support a number of tasks like extractive/abstractive summarization or conversion, but we have not trained an open-ended model. Our first instruct model supports legal drafting and revision, and we'd love to hear what other tasks you'd like supported.
As the 🍊 suggests, KL3M is pronounced like "Clem" or "Klem."
KL3M was originally short for the Kelvin Legal Large Language Model, KLLLM. Because we're nerds, we shortened all those Ls to L cubed or L3, then shortened K-L3-M to KL3M.
Average Rating: 0.0
5 Stars:
0 Ratings
4 Stars:
0 Ratings
3 Stars:
0 Ratings
2 Stars:
0 Ratings
1 Star:
0 Ratings
No ratings available.
Free AI tool generating high-quality paragraphs in various tones and languages.
View DetailsAI-powered writing tool for creating high-quality content with features like advanced model instructions, text editor, essay generation, and text analysis.
View DetailsBoost your productivity 5x with personalized GPT prompts. Free Chrome extension with unlimited chats, no signup needed.
View DetailsAI-powered paragraph generator offering various writing styles, languages, and customization options for efficient content creation.
View DetailsAnonymous, uncensored AI chat with AES encryption and no logs. Offers free and pro plans.
View DetailsWayin AI summarizes videos, supports multiple languages, and allows interactive Q&A via chatbot and screenshot queries.
View DetailsPokecut is a free AI-powered photo editor with tools for background removal, changing, and enhancement. Pro plans offer extra features and credits.
View DetailsConnect your Github repos to ChatGPT & Claude for code assistance, bug finding, and documentation. Free trial available.
View DetailsCreate and interact with a customizable AI girlfriend. Features include AI chat, roleplay, and image generation. NSFW content available.
View DetailsA trivia website with questions in multiple categories. Play now and expand your knowledge!
View DetailsArbor is an automated carbon accounting platform that helps businesses measure, analyze, and reduce their product's carbon footprint quickly and accurately.
View DetailsPhotoLog offers secure, client-side encrypted media storage with mini-site creation, easy sharing, and various storage plans.
View DetailsAI-powered mobile app testing platform with a test automation cloud (Ptero) and a no-code test scenario authoring tool (Stego).
View DetailsAI-powered productivity assistant for ADHD and knowledge workers, centralizing notes, tasks, and AI tools to enhance focus and efficiency.
View Details