Releases: LazyAGI/LazyLLM
v0.3.0
v0.2.5
v0.2.4
Cherry-pick web assistant to dev/0.2 (#259) Co-authored-by: dorren002 <[email protected]> Co-authored-by: wenduren <[email protected]>
v0.2.3
v0.2.2
- Refactored RAG to make it more user-friendly and extensible, allowing flexible customization.
- Added support for multimodal capabilities, including text-to-image, image-text understanding, speech-to-text, and text-to-speech/music.
- Refactored the microservice architecture to support the streaming output of results from any module.
- Added support for Function Call and SQL Call, as well as high-level agents like React, ReWOO, and PlanAndSolve.
- Introduced a new CookBook document and provided bilingual API documentation in both Chinese and English.
- 重构了RAG,使其更加易用,且具备更强的扩展性,方便大家灵活定制
- 支持了多模态的能力,包括文生图、图文理解、语音转文字和文字转语音/音乐
- 重构了微服务架构,支持了任意模块的结果的流式输出
- 支持了Function Call和Sql Call,支持了React、ReWOO、PlanAndSolve等高阶Agent
- 新增了CookBook文档,支持了中英双语的API文档
v0.1.2
LazyLLM v0.1 released!
Features
Convenient AI Application Assembly Process: Even if you are not familiar with large models, you can still easily assemble AI applications with multiple agents using our built-in data flow and functional modules, just like Lego building.
One-Click Deployment of Complex Applications: We offer the capability to deploy all modules with a single click. Specifically, during the POC (Proof of Concept) phase, LazyLLM simplifies the deployment process of multi-agent applications through a lightweight gateway mechanism, solving the problem of sequentially starting each submodule service (such as LLM, Embedding, etc.) and configuring URLs, making the entire process smoother and more efficient. In the application release phase, LazyLLM provides the ability to package images with one click, making it easy to utilize Kubernetes' gateway, load balancing, and fault tolerance capabilities.
Cross-Platform Compatibility: Switch IaaS platforms with one click without modifying code, compatible with bare-metal servers, development machines, Slurm clusters, public clouds, etc. This allows developed applications to be seamlessly migrated to other IaaS platforms, greatly reducing the workload of code modification.
Support for Grid Search Parameter Optimization: Automatically try different base models, retrieval strategies, and fine-tuning parameters based on user configurations to evaluate and optimize applications. This makes hyperparameter tuning efficient without requiring extensive intrusive modifications to application code, helping users quickly find the best configuration.
Efficient Model Fine-Tuning: Support fine-tuning models within applications to continuously improve application performance. Automatically select the best fine-tuning framework and model splitting strategy based on the fine-tuning scenario. This not only simplifies the maintenance of model iterations but also allows algorithm researchers to focus more on algorithm and data iteration, without handling tedious engineering tasks.
What can LazyLLM do
- Application Building: Defines workflows such as pipeline, parallel, diverter, if, switch, and loop. Developers can quickly build multi-agent AI applications based on any functions and modules. Supports one-click deployment for assembled multi-agent applications, and also supports partial or complete updates to the applications.
- Platform-independent: Consistent user experience across different computing platforms. Currently compatible with various platforms such as bare metal, Slurm, SenseCore, etc.
- Supports fine-tuning and inference for large models:
- Offline (local) model services:
- Supports fine-tuning frameworks: collie, peft
- Supports inference frameworks: lightllm, vllm
- Supports automatically selecting the most suitable framework and model parameters (such as micro-bs, tp, zero, etc.) based on user scenarios..
- Online services:
- Supports fine-tuning services: GPT, SenseNova, Tongyi Qianwen
- Supports inference services: GPT, SenseNova, Kimi, Zhipu, Tongyi Qianwen
- Supports embedding inference services: OpenAI, SenseNova, GLM, Tongyi Qianwen
- Support developers to use local services and online services uniformly.
- Offline (local) model services:
- Supports common RAG (Retrieval-Augmented Generation) components: Document, Parser, Retriever, Reranker, etc.
- Supports basic webs: such as chat interface and document management interface, etc.