Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Docs update to indicate use of conda-merge to generate install files #1387

Merged
2 changes: 1 addition & 1 deletion ci/conda/recipes/morpheus/meta.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -82,7 +82,7 @@ outputs:
- libwebp>=1.3.2 # Required for CVE mitigation: https://nvd.nist.gov/vuln/detail/CVE-2023-4863
- mlflow>=2.2.1,<3
- mrc
- networkx 3.1.*
- networkx>=2.8
- numpydoc 1.4.*
- nvtabular {{ rapids_version }}.*
- pandas 1.3.*
Expand Down
1 change: 1 addition & 0 deletions docker/conda/environments/cuda11.8_dev.yml
Original file line number Diff line number Diff line change
Expand Up @@ -21,6 +21,7 @@ channels:
- nvidia/label/dev # For pre-releases of MRC. Should still default to full releases if available
- pytorch
- conda-forge
- defaults
dependencies:
####### Morpheus Dependencies (keep sorted!) #######
- automake=1.16.5
Expand Down
8 changes: 5 additions & 3 deletions docker/conda/environments/cuda11.8_examples.yml
Original file line number Diff line number Diff line change
Expand Up @@ -16,15 +16,17 @@
# Additional dependencies needed by a some of the Morpheus examples.
# The intended usage is to first create the conda environment from the `cuda11.8_dev.yml` file, and then update the
# env with this file. ex:
# mamba env create -n morpheus --file docker/conda/environments/cuda11.8_dev.yml
# conda activate morpheus
# mamba env update -n morpheus --file docker/conda/environments/cuda11.8_examples.yml
# mamba install -n base -c conda-forge conda-merge
# conda run -n base --live-stream conda-merge docker/conda/environments/cuda${CUDA_VER}_dev.yml \
# docker/conda/environments/cuda${CUDA_VER}_examples.yml > .tmp/merged.yml \
# && mamba env update -n morpheus --file ./merged.yml
channels:
- rapidsai
- nvidia
- huggingface
- conda-forge
- dglteam/label/cu118
- defaults
dependencies:
- arxiv=1.4
- boto3
Expand Down
1 change: 1 addition & 0 deletions docker/conda/environments/cuda11.8_runtime.yml
Original file line number Diff line number Diff line change
Expand Up @@ -19,6 +19,7 @@ channels:
- nvidia
- rapidsai-nightly
- conda-forge
- defaults
dependencies:
- nb_conda_kernels
- pip
6 changes: 5 additions & 1 deletion docs/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -22,7 +22,11 @@ Additional packages required for building the documentation are defined in `./co
## Install Additional Dependencies
From the root of the Morpheus repo:
```bash
mamba env update -f docs/conda_docs.yml
export CUDA_VER=11.8
mamba install -n base -c conda-forge conda-merge
conda run -n base --live-stream conda-merge docker/conda/environments/cuda${CUDA_VER}_dev.yml \
docs/conda_docs.yml > .tmp/merged.yml \
&& mamba env update -n ${CONDA_DEFAULT_ENV} --file .tmp/merged.yml
```

## Build Morpheus and Documentation
Expand Down
5 changes: 4 additions & 1 deletion examples/digital_fingerprinting/production/Dockerfile
Original file line number Diff line number Diff line change
Expand Up @@ -31,7 +31,10 @@ COPY ./conda_env.yml ./

# Install DFP dependencies
RUN source activate morpheus \
&& mamba env update -n morpheus -f ./conda_env.yml
&& mamba install -n base -c conda-forge conda-merge \
&& conda run -n base --live-stream conda-merge /workspace/docker/conda/environments/cuda11.8_dev.yml \
./conda_env.yml > ./merged.yml \
&& mamba env update -n morpheus --file ./merged.yml

# Set the tracking URI for mlflow
ENV MLFLOW_TRACKING_URI="http://mlflow:5000"
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -19,51 +19,55 @@
### Set up Morpheus Dev Container

If you don't already have the Morpheus Dev container, run the following to build it:
```
```bash
./docker/build_container_dev.sh
```

Now run the container:
```
```bash
./docker/run_container_dev.sh
```

Note that Morpheus containers are tagged by date. By default, `run_container_dev.sh` will try to use current date as tag. Therefore, if you are trying to run a container that was not built on the current date, you must set the `DOCKER_IMAGE_TAG` environment variable. For example,
```
```bash
DOCKER_IMAGE_TAG=dev-221003 ./docker/run_container_dev.sh
```

In the `/workspace` directory of the container, run the following to compile Morpheus:
```
```bash
./scripts/compile.sh
```

Now install Morpheus:
```
```bash
pip install -e /workspace
```

Install additonal required dependencies:
```
```bash
export CUDA_VER=11.8
mamba env update -n morpheus --file docker/conda/environments/cuda${CUDA_VER}_examples.yml
mamba install -n base -c conda-forge conda-merge
conda run -n base --live-stream conda-merge docker/conda/environments/cuda${CUDA_VER}_dev.yml \
docker/conda/environments/cuda${CUDA_VER}_examples.yml > .tmp/merged.yml \
&& mamba env update -n ${CONDA_DEFAULT_ENV} --file .tmp/merged.yml
```


Fetch input data for benchmarks:
```
```bash
./examples/digital_fingerprinting/fetch_example_data.py all
```

### Start MLflow

MLflow is used as the model repository where the trained DFP models will be published and used for inference by the pipelines. Run the following to start MLflow in a host terminal window (not container):

```
```bash
# from root of Morpheus repo
cd examples/digital_fingerprinting/production
```

```
```bash
docker compose up mlflow
```

Expand Down
6 changes: 5 additions & 1 deletion examples/gnn_fraud_detection_pipeline/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -21,7 +21,11 @@ limitations under the License.
Prior to running the GNN fraud detection pipeline, additional requirements must be installed in to your Conda environment. A supplemental requirements file has been provided in this example directory.

```bash
mamba env update -n ${CONDA_DEFAULT_ENV} -f examples/gnn_fraud_detection_pipeline/requirements.yml
export CUDA_VER=11.8
mamba install -n base -c conda-forge conda-merge
conda run -n base --live-stream conda-merge docker/conda/environments/cuda${CUDA_VER}_dev.yml \
examples/gnn_fraud_detection_pipeline/requirements.yml > .tmp/merged.yml \
&& mamba env update -n ${CONDA_DEFAULT_ENV} --file .tmp/merged.yml
```

## Running
Expand Down
1 change: 1 addition & 0 deletions examples/gnn_fraud_detection_pipeline/requirements.yml
Original file line number Diff line number Diff line change
Expand Up @@ -18,6 +18,7 @@ channels:
- nvidia
- conda-forge
- dglteam/label/cu118
- defaults
dependencies:
- cuml=23.06
- dgl=1.0.2
7 changes: 6 additions & 1 deletion examples/llm/agents/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -95,9 +95,14 @@ export SERPAPI_API_KEY="<YOUR_SERPAPI_API_KEY>"
Install the required dependencies.

```bash
mamba env update -n morpheus --file ${MORPHEUS_ROOT}/docker/conda/environments/cuda11.8_examples.yml
export CUDA_VER=11.8
mamba install -n base -c conda-forge conda-merge
conda run -n base --live-stream conda-merge docker/conda/environments/cuda${CUDA_VER}_dev.yml \
docker/conda/environments/cuda${CUDA_VER}_examples.yml > .tmp/merged.yml \
&& mamba env update -n ${CONDA_DEFAULT_ENV} --file .tmp/merged.yml
```


### Running the Morpheus Pipeline

The top level entrypoint to each of the LLM example pipelines is `examples/llm/main.py`. This script accepts a set
Expand Down
1 change: 1 addition & 0 deletions examples/llm/agents/requirements.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -16,6 +16,7 @@
channels:
- huggingface
- conda-forge
- defaults
dependencies:
- langchain=0.0.190
- pip
Expand Down
13 changes: 9 additions & 4 deletions examples/llm/completion/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -35,9 +35,11 @@ limitations under the License.
The primary goal of this example is to showcase the creation of a pipeline that integrates an LLM service with Morpheus. Although this example features a single implementation, the pipeline and its components are versatile and can be adapted to various scenarios with unique requirements. The following highlights different customization points within the pipeline and the specific choices made for this example:

#### LLM Service

- The pipeline is designed to support any LLM service that adheres to our LLMService interface. Compatible services include OpenAI, NeMo, or even local execution using llama-cpp-python. In this demonstration, we focus on utilizing NeMo as the LLM service, highlighting the advantages it offers over other LLM services and the seamless integration with the NeMo ecosystem. Furthermore, the pipeline can accommodate more complex configurations using NeMo + Inform without necessitating changes to the core pipeline.

#### Downstream Tasks

- Post LLM execution, the model's output can be leveraged for various tasks, including model training, analysis, or simulating an attack. In this particular example, we have simplified the implementation and focused solely on the LLMEngine.

### Pipeline Implementation
Expand All @@ -64,9 +66,14 @@ Before running the pipeline, ensure that the `NGC_API_KEY` environment variable
Install the required dependencies.

```bash
mamba env update -n morpheus --file ${MORPHEUS_ROOT}/docker/conda/environments/cuda11.8_examples.yml
export CUDA_VER=11.8
mamba install -n base -c conda-forge conda-merge
conda run -n base --live-stream conda-merge docker/conda/environments/cuda${CUDA_VER}_dev.yml \
docker/conda/environments/cuda${CUDA_VER}_examples.yml > .tmp/merged.yml \
&& mamba env update -n ${CONDA_DEFAULT_ENV} --file .tmp/merged.yml
efajardo-nv marked this conversation as resolved.
Show resolved Hide resolved
```


#### Setting up NGC API Key

For this example, we utilize the NeMo Service within NGC. To gain access, an NGC API key is required. Follow the
Expand All @@ -75,7 +82,6 @@ generate your NGC API key.

Configure the following environment variables, with NGC_ORG_ID being optional:


```bash
export NGC_API_KEY=<YOUR_API_KEY>
export NGC_ORG_ID=<YOUR_NGC_ORG_ID>
Expand Down Expand Up @@ -105,7 +111,7 @@ python examples/llm/main.py completion [OPTIONS] COMMAND [ARGS]...

- `--pipeline_batch_size INTEGER RANGE`
- **Description**: Internal batch size for the pipeline. Can be much larger than the model batch size.
Also used for Kafka consumers.
Also used for Kafka consumers.
- **Default**: `1024`

- `--model_max_batch_size INTEGER RANGE`
Expand All @@ -123,7 +129,6 @@ python examples/llm/main.py completion [OPTIONS] COMMAND [ARGS]...
- `--help`
- **Description**: Show the help message with options and commands details.


### Running Morpheus Pipeline with OpenAI LLM service

```bash
Expand Down
1 change: 1 addition & 0 deletions examples/llm/completion/requirements.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -15,6 +15,7 @@

channels:
- conda-forge
- defaults
dependencies:
- arxiv=1.4
- langchain=0.0.190
Expand Down
1 change: 1 addition & 0 deletions examples/llm/rag/requirements.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -16,6 +16,7 @@
channels:
- huggingface
- conda-forge
- defaults
dependencies:
- pip
- openai=0.28
Expand Down
1 change: 1 addition & 0 deletions examples/llm/vdb_upload/requirements.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -15,6 +15,7 @@

channels:
- conda-forge
- defaults
dependencies:
- arxiv=1.4
- onnx # required for triton model export
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -17,6 +17,7 @@ channels:
- rapidsai
- nvidia
- conda-forge
- defaults
dependencies:
- cuml=23.06
- jupyterlab
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -17,6 +17,7 @@ channels:
- nvidia
- pytorch
- conda-forge
- defaults
dependencies:
- dill
- jupyterlab
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -23,13 +23,17 @@ limitations under the License.

Install packages for training GNN model.

```
mamba env update -n ${CONDA_DEFAULT_ENV} -f requirements.yml
```bash
export CUDA_VER=11.8
mamba install -n base -c conda-forge conda-merge
conda run -n base --live-stream conda-merge docker/conda/environments/cuda${CUDA_VER}_dev.yml \
models/training-tuning-scripts/fraud-detection-models/requirements.yml > .tmp/merged.yml \
&& mamba env update -n ${CONDA_DEFAULT_ENV} --file .tmp/merged.yml
```

### Options for training and tuning models.

```
```bash
python training.py --help
Usage: training.py [OPTIONS]

Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -52,7 +52,6 @@
"%autoreload 2\n",
"import pandas as pd\n",
"import numpy as np\n",
"import matplotlib.pylab as plt\n",
"import os\n",
"import dgl\n",
"import numpy as np\n",
Expand Down Expand Up @@ -1011,7 +1010,7 @@
"name": "python",
"nbconvert_exporter": "python",
"pygments_lexer": "ipython3",
"version": "3.10.12"
"version": "3.10.13"
}
},
"nbformat": 4,
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -19,12 +19,12 @@ channels:
- dglteam/label/cu118
- pytorch
- conda-forge
- defaults
dependencies:
- click>=8
- cuml=23.06
- dgl
- jupyterlab
- matplotlib
- pytorch-cuda=11.8
- pytorch=2.0.1
- scikit-learn=1.2.2
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -18,6 +18,7 @@ channels:
- nvidia
- pytorch
- conda-forge
- defaults
dependencies:
- cudf=23.06
- jupyterlab
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -18,6 +18,7 @@ channels:
- nvidia
- pytorch
- conda-forge
- defaults
dependencies:
- cudf=23.06
- jupyterlab
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -15,6 +15,7 @@

channels:
- conda-forge
- defaults
dependencies:
- jupyterlab
- matplotlib
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -18,6 +18,7 @@ channels:
- nvidia
- pytorch
- conda-forge
- defaults
dependencies:
- cudf=23.06
- jupyterlab
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -18,6 +18,7 @@ channels:
- nvidia
- pytorch
- conda-forge
- defaults
dependencies:
- cudf=23.06
- jupyterlab
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -17,6 +17,7 @@ channels:
- rapidsai
- nvidia
- conda-forge
- defaults
dependencies:
- click==8.1.3
- cuml=23.06
Expand Down
2 changes: 1 addition & 1 deletion morpheus/llm/nodes/extracter_node.py
Original file line number Diff line number Diff line change
Expand Up @@ -30,7 +30,7 @@ class ExtracterNode(LLMNodeBase):
"""

def get_input_names(self) -> list[str]:
# This node does not receive it's inputs from upstream nodes, but rather from the task itself
# This node does not receive its inputs from upstream nodes, but rather from the task itself
return []

async def execute(self, context: LLMContext) -> LLMContext:
Expand Down
5 changes: 4 additions & 1 deletion morpheus/llm/services/nemo_llm_service.py
Original file line number Diff line number Diff line change
Expand Up @@ -25,7 +25,10 @@

IMPORT_ERROR_MESSAGE = (
"NemoLLM not found. Install it and other additional dependencies by running the following command:\n"
"`mamba env update -n ${CONDA_DEFAULT_ENV} --file docker/conda/environments/cuda11.8_examples.yml`")
"`mamba install -n base -c conda-forge conda-merge`\n"
"`conda run -n base --live-stream conda-merge docker/conda/environments/cuda${CUDA_VER}_dev.yml "
" docker/conda/environments/cuda${CUDA_VER}_examples.yml"
" > .tmp/merged.yml && mamba env update -n morpheus --file .tmp/merged.yml`")

try:
import nemollm
Expand Down
Loading