From 31b4806b86f61fed537fae0ff59608086b8eabb8 Mon Sep 17 00:00:00 2001 From: Yuan-Man <68322456+Yuan-ManX@users.noreply.github.com> Date: Fri, 30 Aug 2024 10:07:32 +0800 Subject: [PATCH] Update index.md --- index.md | 14 ++++++++++++++ 1 file changed, 14 insertions(+) diff --git a/index.md b/index.md index 8218949..bc533a7 100644 --- a/index.md +++ b/index.md @@ -36,6 +36,7 @@ Here we will keep track of the latest AI Game Development Tools, including LLM, | [AgentGPT](https://github.com/reworkd/AgentGPT) | 🤖 Assemble, configure, and deploy autonomous AI Agents in your browser. | | | Tool | | [AICommand](https://github.com/keijiro/AICommand) | ChatGPT integration with Unity Editor. | | Unity | Tool | | [AIOS](https://github.com/agiresearch/AIOS) | LLM Agent Operating System. | | | Tool | +| [AI Scientist](https://github.com/SakanaAI/AI-Scientist) | The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery. |[arXiv](https://arxiv.org/abs/2408.06292) | | Tool | | [Assistant CLI](https://github.com/diciaup/assistant-cli) | A comfortable CLI tool to use ChatGPT service🔥 | | | Tool | | [Auto-GPT](https://github.com/Significant-Gravitas/Auto-GPT) | An experimental open-source attempt to make GPT-4 fully autonomous. | | | Tool | | [BabyAGI](https://github.com/yoheinakajima/babyagi) | This Python script is an example of an AI-powered task management system. | | | Tool | @@ -96,6 +97,7 @@ Here we will keep track of the latest AI Game Development Tools, including LLM, | [LLMUnity](https://github.com/undreamai/LLMUnity) | Create characters in Unity with LLMs! | | Unity | Tool | | [LLocalSearch](https://github.com/nilsherzig/LLocalSearch) | LLocalSearch is a completely locally running search engine using LLM Agents. | | | Tool | | [LogicGamesSolver](https://github.com/fabridigua/LogicGamesSolver) | A Python tool to solve logic games with AI, Deep Learning and Computer Vision. | | | Tool | +| [LongWriter](https://github.com/THUDM/LongWriter) | LongWriter: Unleashing 10,000+ Word Generation From Long Context LLMs. |[arXiv](https://arxiv.org/abs/2408.07055) | | Tool | | [Large World Model (LWM)](https://github.com/LargeWorldModel/LWM) | Large World Model (LWM) is a general-purpose large-context multimodal autoregressive model. |[arXiv](https://arxiv.org/abs/2402.08268) | | Tool | | [Lumina-T2X](https://github.com/Alpha-VLLM/Lumina-T2X) | Lumina-T2X is a unified framework for Text to Any Modality Generation. |[arXiv](https://arxiv.org/abs/2405.05945) | | Tool | | [MetaGPT](https://github.com/geekan/MetaGPT) | The Multi-Agent Framework | | | Tool | @@ -156,6 +158,7 @@ Here we will keep track of the latest AI Game Development Tools, including LLM, | :------------------------------------------------------------------------------------------ | :--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- | :-----------: | :-----------: | :-------: | | [AgentBench](https://github.com/thudm/agentbench) | A Comprehensive Benchmark to Evaluate LLMs as Agents. |[arXiv](https://arxiv.org/abs/2308.03688) | | Agent | | [Agent Group Chat](https://github.com/MikeGu721/AgentGroup) | An Interactive Group Chat Simulacra For Better Eliciting Collective Emergent Behavior. |[arXiv](https://arxiv.org/abs/2403.13433) | | Agent | +| [Agent K](https://github.com/mikekelly/AgentK) | An autoagentic AGI that is self-evolving and modular. | | | Agent | | [AgentScope](https://github.com/modelscope/agentscope) | Start building LLM-empowered multi-agent applications in an easier way. |[arXiv](https://arxiv.org/abs/2402.14034) | | Agent | | [AgentSims](https://github.com/py499372727/AgentSims/) | An Open-Source Sandbox for Large Language Model Evaluation. | | | Agent | | [AI Town](https://github.com/a16z-infra/ai-town) | AI Town is a virtual town where AI characters live, chat and socialize. | | | Agent | @@ -182,6 +185,7 @@ Here we will keep track of the latest AI Game Development Tools, including LLM, | [FastGPT](https://github.com/labring/FastGPT) | FastGPT is a knowledge-based platform built on the LLM. | | | Agent | | [fastRAG](https://github.com/IntelLabs/fastRAG) | Efficient Retrieval Augmentation and Generation Framework. | | | Agent | | [GameAISDK](https://github.com/Tencent/GameAISDK) | Image-based game AI automation framework. | | | Framework | +| [GameNGen](https://gamengen.github.io/) | Diffusion Models Are Real-Time Game Engines. |[arXiv](https://arxiv.org/abs/2408.14837) | | Game | | [Generative Agents](https://github.com/joonspk-research/generative_agents) | Interactive Simulacra of Human Behavior. |[arXiv](https://arxiv.org/abs/2304.03442) | | Agent | | [Genie](https://sites.google.com/view/genie-2024/home) | Generative Interactive Environments. | | | Game | | [gigax](https://github.com/GigaxGames/gigax) | Runtime, LLM-powered NPCs. | | | Game | @@ -197,6 +201,7 @@ Here we will keep track of the latest AI Game Development Tools, including LLM, | [LlamaIndex](https://github.com/run-llama/llama_index) | LlamaIndex is a data framework for your LLM application. | | | Agent | | [MindSearch](https://github.com/InternLM/MindSearch) | 🔍 An LLM-based Multi-agent Framework of Web Search Engine (like Perplexity.ai Pro and SearchGPT). | | | Agent | | [Mixture of Agents (MoA)](https://github.com/togethercomputer/MoA) | Mixture-of-Agents Enhances Large Language Model Capabilities. |[arXiv](https://arxiv.org/abs/2406.04692) | | Agent | +| [MMRole](https://github.com/YanqiDai/MMRole) | MMRole: A Comprehensive Framework for Developing and Evaluating Multimodal Role-Playing Agents. |[arXiv](https://arxiv.org/abs/2408.04203v1) | | Agent | | [Moonlander.ai](https://www.moonlander.ai/) | Start building 3D games without any coding using generative AI. | | | Framework | | [MuG Diffusion](https://github.com/Keytoyze/Mug-Diffusion) | MuG Diffusion is a charting AI for rhythm games based on Stable Diffusion (one of the most powerful AIGC models) with a large modification to incorporate audio waves. | | | Game | | [OmAgent](https://github.com/om-ai-lab/OmAgent) | A multimodal agent framework for solving complex tasks. | | | Agent | @@ -205,10 +210,13 @@ Here we will keep track of the latest AI Game Development Tools, including LLM, | [Pipecat](https://github.com/pipecat-ai/pipecat) | Open Source framework for voice and multimodal conversational AI. | | | Agent | | [Qwen-Agent](https://github.com/QwenLM/Qwen-Agent) | Qwen-Agent is a framework for developing LLM applications based on the instruction following, tool usage, planning, and memory capabilities of Qwen. | | | Agent | | [Ragas](https://github.com/explodinggradients/ragas) | Ragas is a framework that helps you evaluate your Retrieval Augmented Generation (RAG) pipelines. | | | Agent | +| [RPBench-Auto](https://github.com/boson-ai/RPBench-Auto) | An automated pipeline for evaluating LLMs for role-playing. | | | Game | | [SIMA](https://deepmind.google/discover/blog/sima-generalist-ai-agent-for-3d-virtual-environments/) | A generalist AI agent for 3D virtual environments. | | | Agent | | [StoryGames.ai](https://storygames.buildbox.com/) | AI for Dreamers Make Games. | | | Game | | [SWE-agent](https://github.com/princeton-nlp/SWE-agent) | Agent Computer Interfaces Enable Software Engineering Language Models. |[arXiv](https://arxiv.org/abs/2405.15793) | | Agent | +| [TaskGen](https://github.com/simbianai/taskgen) | A Task-based agentic framework building on StrictJSON outputs by LLM agents. | | | Agent | | [Translation Agent](https://github.com/andrewyng/translation-agent) | Agentic translation using reflection workflow. | | | Agent | +| [Twitter](https://github.com/wordware-ai/twitter) | Twitter Personality is a web application that analyzes your Twitter handle to create a personalized personality profile using Wordware AI Agent. | | | Agent | | [Video2Game](https://github.com/video2game/video2game) | Real-time, Interactive, Realistic and Browser-Compatible Environment from a Single Video. |[arXiv](https://arxiv.org/abs/2404.09833) | | Game | | [V-IRL](https://virl-platform.github.io/) | Grounding Virtual Intelligence in Real Life. |[arXiv](https://arxiv.org/abs/2402.03310) | | Agent | | [WebDesignAgent](https://github.com/DAMO-NLP-SG/WebDesignAgent) | An agent used for webdesign. | | | Agent | @@ -234,6 +242,7 @@ Here we will keep track of the latest AI Game Development Tools, including LLM, | [CodeTF](https://github.com/salesforce/codetf) | One-stop Transformer Library for State-of-the-art Code LLM. | | | Code | | [CodeT5](https://github.com/salesforce/codet5) | Open Code LLMs for Code Understanding and Generation. | | | Code | | [Cursor](https://www.cursor.so/) | Write, edit, and chat about your code with GPT-4 in a new type of editor. | | | Code | +| [DeepSeek Coder](https://github.com/deepseek-ai/DeepSeek-Coder) | DeepSeek Coder: Let the Code Write Itself. |[arXiv](https://arxiv.org/abs/2401.14196) | | Code | | [OpenAI Codex](https://openai.com/blog/openai-codex) | OpenAI Codex is a descendant of GPT-3. | | | Code | | [PandasAI](https://github.com/gventuri/pandas-ai) | Pandas AI is a Python library that integrates generative artificial intelligence capabilities into Pandas, making dataframes conversational. | | | Code | | [RobloxScripterAI](https://www.haddock.ai/search?platform=Roblox) | RobloxScripterAI is an AI-powered code generation tool for Roblox. | | Roblox | Code | @@ -307,6 +316,7 @@ Here we will keep track of the latest AI Game Development Tools, including LLM, | [LayerDiffusion](https://github.com/layerdiffusion/LayerDiffusion) | Transparent Image Layer Diffusion using Latent Transparency. |[arXiv](https://arxiv.org/abs/2305.18676) | | Image | | [Lexica](https://lexica.art/) | A Stable Diffusion prompts search engine. | | | Image | | [LlamaGen](https://github.com/FoundationVision/LlamaGen) | Autoregressive Model Beats Diffusion: Llama for Scalable Image Generation. |[arXiv](https://arxiv.org/abs/2406.06525) | | Image | +| [Lumina-mGPT](https://github.com/Alpha-VLLM/Lumina-mGPT) | Lumina-mGPT: Illuminate Flexible Photorealistic Text-to-Image Generation with Multimodal Generative Pretraining. |[arXiv](https://arxiv.org/abs/2408.02657) | | Image | | [MetaShoot](https://metashoot.vinzi.xyz/) | MetaShoot is a digital twin of a photo studio, developed as a plugin for Unreal Engine that gives any creator the ability to produce highly realistic renders in the easiest and quickest way. | | Unreal Engine | Image | | [Midjourney](https://www.midjourney.com/) | Midjourney is an independent research lab exploring new mediums of thought and expanding the imaginative powers of the human species. | | | Image | | [MIGC](https://github.com/limuloo/MIGC) | MIGC: Multi-Instance Generation Controller for Text-to-Image Synthesis. |[arXiv](https://arxiv.org/abs/2402.05408) | | Image | @@ -533,6 +543,7 @@ Here we will keep track of the latest AI Game Development Tools, including LLM, | [Kangaroo](https://github.com/KangarooGroup/Kangaroo) | Kangaroo: A Powerful Video-Language Model Supporting Long-context Video Input. | | | Visual | | [LGVI](https://jianzongwu.github.io/projects/rovi/) | Towards Language-Driven Video Inpainting via Multimodal Large Language Models. | | | Visual | | [LLaVA++](https://github.com/mbzuai-oryx/LLaVA-pp) | Extending Visual Capabilities with LLaMA-3 and Phi-3. | | | Visual | +| [LLaVA-OneVision](https://github.com/LLaVA-VL/LLaVA-NeXT) | LLaVA-OneVision: Easy Visual Task Transfer. |[arXiv](https://arxiv.org/abs/2408.03326) | | Visual | | [LongVA](https://github.com/EvolvingLMMs-Lab/LongVA) | Long Context Transfer from Language to Vision. |[arXiv](https://arxiv.org/abs/2406.16852) | | Visual | | [MaskViT](https://maskedvit.github.io/) | Masked Visual Pre-Training for Video Prediction. |[arXiv](https://arxiv.org/abs/2206.11894) | | Visual | | [MiniCPM-Llama3-V 2.5](https://github.com/OpenBMB/MiniCPM-V) | A GPT-4V Level MLLM on Your Phone. | | | Visual | @@ -540,6 +551,7 @@ Here we will keep track of the latest AI Game Development Tools, including LLM, | [MotionLLM](https://github.com/IDEA-Research/MotionLLM) | Understanding Human Behaviors from Human Motions and Videos. |[arXiv](https://arxiv.org/abs/2405.20340) | | Visual | | [PLLaVA](https://github.com/magic-research/PLLaVA) | Parameter-free LLaVA Extension from Images to Videos for Video Dense Captioning. |[arXiv](https://arxiv.org/abs/2404.16994) | | Visual | | [Qwen-VL](https://github.com/QwenLM/Qwen-VL) | A Versatile Vision-Language Model for Understanding, Localization, Text Reading, and Beyond. |[arXiv](https://arxiv.org/abs/2308.12966) | | Visual | +| [Sapiens](https://github.com/facebookresearch/sapiens) | Sapiens: Foundation for Human Vision Models. |[arXiv](https://arxiv.org/abs/2408.12569) | | Visual | | [ShareGPT4V](https://github.com/ShareGPT4Omni/ShareGPT4V) | Improving Large Multi-modal Models with Better Captions. |[arXiv](https://arxiv.org/abs/2311.12793) | | Visual | | [SOLO](https://github.com/Yangyi-Chen/SOLO) | SOLO: A Single Transformer for Scalable Vision-Language Modeling. |[arXiv](https://arxiv.org/abs/2407.06438) | | Visual | | [Video-CCAM](https://github.com/QQ-MM/Video-CCAM) | Video-CCAM: Advancing Video-Language Understanding with Causal Cross-Attention Masks. | | | Visual | @@ -612,6 +624,7 @@ Here we will keep track of the latest AI Game Development Tools, including LLM, | [Moonvalley](https://moonvalley.ai/) | Moonvalley is a groundbreaking new text-to-video generative AI model. | | | Video | | [Mora](https://github.com/lichao-sun/Mora) | More like Sora for Generalist Video Generation. |[arXiv](https://arxiv.org/abs/2403.13248) | | Video | | [Morph Studio](https://www.morphstudio.com/) | With our Text-to-Video AI Magic, manifest your creativity through your prompt. | | | Video | +| [MotionClone](https://github.com/Bujiazi/MotionClone) | MotionClone: Training-Free Motion Cloning for Controllable Video Generation. |[arXiv](https://arxiv.org/abs/2406.05338) | | Video | | [MotionCtrl](https://wzhouxiff.github.io/projects/MotionCtrl/) | A Unified and Flexible Motion Controller for Video Generation. |[arXiv](https://arxiv.org/abs/2312.03641) | | Video | | [MotionDirector](https://github.com/showlab/MotionDirector) | Motion Customization of Text-to-Video Diffusion Models. |[arXiv](https://arxiv.org/abs/2310.08465) | | Video | | [Motionshop](https://aigc3d.github.io/motionshop/) | An application of replacing the characters in video with 3D avatars. | | | Video | @@ -778,6 +791,7 @@ Here we will keep track of the latest AI Game Development Tools, including LLM, | [Stable Speech](https://github.com/sanchit-gandhi/stable-speech) | Stability AI's Text-to-Speech model. | | | Speech | | [StableTTS](https://github.com/KdaiP/StableTTS) | Next-generation TTS model using flow-matching and DiT, inspired by Stable Diffusion 3. | | | Speech | | [StyleTTS 2](https://github.com/yl4579/StyleTTS2) | Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models. | [arXiv](https://arxiv.org/abs/2306.07691) | | Speech | +| [tortoise.cpp](https://github.com/balisujohn/tortoise.cpp) | tortoise.cpp: GGML implementation of tortoise-tts. | | | Speech | | [TorToiSe-TTS](https://github.com/neonbjb/tortoise-tts) | A multi-voice TTS system trained with an emphasis on quality. | | | Speech | | [TTS Generation WebUI](https://github.com/rsxdalv/tts-generation-webui) | TTS Generation WebUI (Bark, MusicGen, Tortoise, RVC, Vocos, Demucs). | | | Speech | | [VALL-E](https://valle-demo.github.io/) | Neural Codec Language Models are Zero-Shot Text to Speech Synthesizers. | [arXiv](https://arxiv.org/abs/2301.02111) | | Speech |