📝 Basic_LCEL 활용하기

Summary

상황별 Method 활용법

Runnable 활용법

LCEL(LangChain Expression Language)은 프롬프트 구성, 모델 인스턴스 생성, 출력 생성의 과정을 ==Chain==으로 묶어 복잡한 워크플로우를 쉽고 직관적으로 구축할 수 있도록 돕는 인터페이스이다.

특수문자(|)를 활용하여 본인만의 Chain을 구축할 수 있다.

1️⃣ Methods

Sync/Async	Description
`invoke`/`ainvoke`	입력에 대한 결과를 출력한다.
`batch`/`abatch`	반복되는 입력을 리스트로 입력하여 처리한다.
`stream`/`astream`	chunk마다 출력되게 한다.
`astream_log`	중간 단계를 스트리밍한다.

동기(Synchronous)와 비동기(Asynchronous)

	동기(Synchronous)	비동기(Asynchronous)
설명	작업을 순차적으로 진행	여러 작업을 동시에 진행
장점	코드가 단순하고 이해하기 쉽다 디버깅이 쉽다	효율적이다 반응성이 좋다
단점	자원 활용이 비효율적이다 한 작업이 잘못되면 전체 프로그램이 멈출 수 있다	디버깅이 복잡하다

# 기본 코드
from langchain_openai import ChatOpenAI 
from langchain_core.prompts import PromptTemplate
from langchain_core.output_parsers import StrOutputParser
 
prompt = PromptTemplate.from_template("{input}에 대해 한국어로 한 줄로 설명해줘")
model = ChatOpenAI(model_name = "gpt-3.5-turbo") 
output_parser = StrOutputParser()
chain = prompt | model | output_parser

📋 `invoke / ainvoke`

import time 
import asyncio
 
def run_sync(input_list):
    """invoke 실행 함수"""
    start_time = time.time()
    for input in input_list:
        result = chain.invoke(input)
        print(result)
    end_time = time.time()
    print("="*100)
    print(f"Sync execution time: {end_time - start_time:.2f} seconds")
 
async def run_async(input_list):
    """ainvoke 실행 함수"""
    start_time = time.time()
    tasks = [chain.ainvoke(input) for input in input_list]
    results = await asyncio.gather(*tasks)
    end_time = time.time()
    print(f"Async execution time: {end_time - start_time:.2f} seconds")
    print("="*100)
 
    for result in results:
        print(result)
 
run_sync(input_list)
await run_async(input_list)

# Sync execution time: 7.13 seconds
# Async execution time: 1.58 seconds

📋 `batch` / `abatch`

batch와 abatch의 속도 차이가 크게 나지 않는 것처럼 보이지만, 보다 더 복잡한 코드에서는 차이가 날 것이다.

import time
import asyncio
 
def run_sync(input_list):
    """batch 실행 함수"""
    start_time = time.time()
    result = chain.batch(input_list)
    end_time = time.time()
    print(f"Sync execution time: {end_time - start_time:.2f} seconds")
    print("="*100)
    print("\n".join(result))
 
async def run_async():
    """abatch 실행 함수"""
    start_time = time.time()
    tasks = chain.abatch(input_list)
    result = await tasks
    end_time = time.time()
    print(f"Async execution time: {end_time - start_time:.2f} seconds")
    print("="*100)
    print("\n".join(result))
 
run_sync(input_list)
await run_async(input_list)

# Sync execution time: 1.78 seconds
# Async execution time: 1.65 seconds

📋 `stream / astream`

generator로 출력되어 for loop으로 print 하면 chunk별로 streaming 된다.

# generator로 출력되는 것을 확인
chain.stream({"input":"파이썬"}) 
 
# Output
# <generator object RunnableSequence.stream at 0x0000014E37FB6650>

# stream
for chunk in chain.stream({"input":"파이썬"}):
    print(chunk, end="", flush=True)
 
# astream
for chunk in chain.stream({"input":"파이썬"}):
    print(chunk, end="", flush=True)

📋 `stream_log`

chain 실행 과정을 로깅하는 함수로 디버깅할 때 용이하다.

stream = chain.astream_log({"input":"파이썬"})
async for chunk in stream:
    print(chunk)
    print("="*100)

코드 결과

> RunLogPatch({'op': 'replace',
  'path': '',
  'value': {'final_output': None,
            'id': '7e312a61-7190-46b7-8f04-c88ab333f58d',
            'logs': {},
            'name': 'RunnableSequence',
            'streamed_output': [],
            'type': 'chain'}})
==================================================
RunLogPatch({'op': 'add',
  'path': '/logs/PromptTemplate',
  'value': {'end_time': None,
            'final_output': None,
            'id': 'aec36f3d-a8ea-4b51-8c74-e0ab0d56b3b6',
            'metadata': {},
            'name': 'PromptTemplate',
            'start_time': '2024-06-27T01:56:14.017+00:00',
            'streamed_output': [],
            'streamed_output_str': [],
            'tags': ['seq:step:1'],
            'type': 'prompt'}})
==================================================
RunLogPatch({'op': 'add',
  'path': '/logs/PromptTemplate/final_output',
  'value': StringPromptValue(text='파이썬에 대해 한국어로 한 줄로 설명해줘')},
 {'op': 'add',
  'path': '/logs/PromptTemplate/end_time',
  'value': '2024-06-27T01:56:14.023+00:00'})
==================================================
RunLogPatch({'op': 'add',
  'path': '/logs/ChatOpenAI',
  'value': {'end_time': None,
            'final_output': None,
            'id': 'b0736693-8368-4871-8021-62c6c0f3255f',
            'metadata': {},
            'name': 'ChatOpenAI',
            'start_time': '2024-06-27T01:56:14.030+00:00',
            'streamed_output': [],
            'streamed_output_str': [],
            'tags': ['seq:step:2'],
            'type': 'llm'}})
==================================================
RunLogPatch({'op': 'add', 'path': '/logs/ChatOpenAI/streamed_output_str/-', 'value': ''},
 {'op': 'add',
  'path': '/logs/ChatOpenAI/streamed_output/-',
  'value': AIMessageChunk(content='', id='run-b0736693-8368-4871-8021-62c6c0f3255f')})
==================================================
RunLogPatch({'op': 'add',
  'path': '/logs/StrOutputParser',
  'value': {'end_time': None,
            'final_output': None,
            'id': '202ac825-a78c-47c7-bf02-bbe3e8e3b798',
            'metadata': {},
            'name': 'StrOutputParser',
            'start_time': '2024-06-27T01:56:14.866+00:00',
            'streamed_output': [],
            'streamed_output_str': [],
            'tags': ['seq:step:3'],
            'type': 'parser'}})
==================================================
RunLogPatch({'op': 'add', 'path': '/logs/StrOutputParser/streamed_output/-', 'value': ''})
==================================================
RunLogPatch({'op': 'add', 'path': '/streamed_output/-', 'value': ''},
 {'op': 'replace', 'path': '/final_output', 'value': ''})
==================================================
RunLogPatch({'op': 'add', 'path': '/logs/ChatOpenAI/streamed_output_str/-', 'value': '파'},
 {'op': 'add',
  'path': '/logs/ChatOpenAI/streamed_output/-',
  'value': AIMessageChunk(content='파', id='run-b0736693-8368-4871-8021-62c6c0f3255f')})
==================================================
RunLogPatch({'op': 'add', 'path': '/logs/StrOutputParser/streamed_output/-', 'value': '파'})
==================================================
RunLogPatch({'op': 'add', 'path': '/streamed_output/-', 'value': '파'},
 {'op': 'replace', 'path': '/final_output', 'value': '파'})
==================================================
RunLogPatch({'op': 'add', 'path': '/logs/ChatOpenAI/streamed_output_str/-', 'value': '이'},
 {'op': 'add',
  'path': '/logs/ChatOpenAI/streamed_output/-',
  'value': AIMessageChunk(content='이', id='run-b0736693-8368-4871-8021-62c6c0f3255f')})
==================================================
RunLogPatch({'op': 'add', 'path': '/logs/StrOutputParser/streamed_output/-', 'value': '이'})
==================================================
RunLogPatch({'op': 'add', 'path': '/streamed_output/-', 'value': '이'},
 {'op': 'replace', 'path': '/final_output', 'value': '파이'})
==================================================

📋 callbacks

Streaming은 ChatOpenAI에서 callbacks 옵션으로도 가능하다.

BaseCallbackHandler클래스가 있는데 이 클래스를 커스텀하여 .stream()보다 더 다양한 상황에 적용할 수 있다.

클래스와 관련된 내용은 Langchain 공식문서에서 참고하자.

# 기본 코드
from typing import Any
from langchain_openai import ChatOpenAI 
from langchain_core.prompts import PromptTemplate
from langchain_core.output_parsers import StrOutputParser
from langchain_core.callbacks import BaseCallbackHandler
 
class CustomHandler(BaseCallbackHandler):
    def on_llm_new_token(self, token: str, **kwargs: Any) -> Any:
        """Run on new LLM token. Only available when streaming is enabled."""
        print(token, end="", flush=True)
 
prompt = PromptTemplate.from_template("{input}에 대해 한국어로 한 줄로 설명해줘")
output_parser = StrOutputParser()

방법 1

model = ChatOpenAI(
    model_name = "gpt-3.5-turbo", 
    streaming=True, 
    callbacks=[CustomHandler()]
) 
chain = prompt | model | output_parser
 
response = chain.invoke({"input": "파이썬"})

방법 2

model = ChatOpenAI(
    model_name = "gpt-3.5-turbo", 
    streaming=True
)
chain = prompt | model | output_parser
 
model.callbacks = [CustomHandler()]
response = chain.invoke({"input": "파이썬"})

방법 3

model = ChatOpenAI(
    model_name = "gpt-3.5-turbo", 
    streaming=True
)
chain = prompt | model | output_parser
 
response = chain.invoke(
    {"input": "파이썬"}, 
    {"callbacks": [CustomHandler()]}
)

2️⃣ Runnable

입력값들에 대한 변형이 필요한 경우 유연하게 커스텀할 수 있는 도구들이다.

Function	Description
`RunnablePassthrough()`	입력된 값을 그대로 전달한다.
`RunnablePassthrough.assign()`	입력된 값을 변환하거나 새로운 변수를 만든다.
`RunnableLambda()`	입력된 값을 이용해서 함수로 새로운 변수를 만든다.
`RunnableParallel()`	동일한 입력을 가진 chain을 병렬로 처리한다.

# 기본 코드
from langchain_openai import ChatOpenAI 
from langchain_core.prompts import PromptTemplate
from langchain_core.output_parsers import StrOutputParser
 
prompt = PromptTemplate.from_template("'{input}'을 영어로 번역해주세요")
model = ChatOpenAI(model_name = "gpt-3.5-turbo") 
output_parser = StrOutputParser()

📋 `RunnablePassthrough()`

입력된 값을 그대로 전달한다.

따라서 원래는 딕셔너리로 입력해야 했지만, 문자열로 입력 받아 runnable에서 딕셔너리를 만든 후 프롬프트에 전달할 수 있다.

runnable = {"input": RunnablePassthrough()}
chain = runnable | prompt | model | output_parser 
chain.invoke("다람쥐")
 
# Output
# 'Squirrel'

📋 `RunnablePassthrough.assign()`

입력값을 변형하거나 새로운 입력값을 만들 수 있다.

add_runnable = RunnablePassthrough.assign(input = lambda x: x["input"] + "를 보았습니다")
add_runnable.invoke({"input": "다람쥐"})
 
# Output
# {'input': '다람쥐를 보았습니다'}

📋 `RunnableLambda()`

함수를 만들어 입력값을 변형할 수 있다.

from langchain_core.runnables import RunnableLambda
 
def add_text(input):
    return "세상에서 가장 작은 " + input
 
runnable = {"input": RunnableLambda(add_text)}
chain = runnable | prompt | model | output_parser 
chain.invoke("다람쥐")
 
# Output
# 'The smallest squirrel in the world'

📋 `RunnableParallel()`

입력값이 동일한 여러 개의 체인을 병렬적으로 관리할 수 있다.

prompt1 = PromptTemplate.from_template("{country}의 주요 언어를 알려줘")
prompt2 = PromptTemplate.from_template("{country}의 대표적인 랜드마크 3개를 알려줘")
 
chain1 = prompt1 | model | output_parser
chain2 = prompt2 | model | output_parser
 
combined = RunnableParallel(
    language = chain1,
    landmarks = chain2
)
 
combined.invoke({"country":"한국"})
 
# Output
# {'language': '한국의 주요 언어는 한국어입니다. 한국어는 대부분의 한국 사람들이 사용하는 언어로, 국내에서는 공식 언어로 사용되고 있습니다. 또한, 영어도 많은 사람들이 학습하고 사용하고 있으며, 중국어와 일본어도 일부 지역에서 사용되고 있습니다.',
#  'landmarks': '1. 남산타워 - 서울의 대표적인 랜드마크로서, 서울 시내와 한강을 한눈에 볼 수 있는 전망대가 유명하다.\n2. 경복궁 - 서울에 위치한 조선 시대의 궁궐로서, 아름다운 전통 한옥 건물과 근정전, 경회루 등을 볼 수 있다.\n3. 부산 타워 - 부산의 랜드마크로서, 부산 시내와 해안도로를 한눈에 볼 수 있는 전망대와 야경이 유명하다.'}

📋 `itemgetter`의 활용

itemgetter는 딕셔너리에서 특정 키의 value를 추출하는 기능을 한다.

from operator import itemgetter
 
prompt = PromptTemplate.from_template("'{input}'을 {language}번역해주세요")
runnable = {
    "input" : itemgetter("input") | RunnableLambda(lambda x: x + "는 맛있어"),
    "language": itemgetter("language")
}
chain = runnable | prompt | model | output_parser
chain.invoke({"input": "오렌지", "language": "영어"})
 
# Output
# '"Orange is delicious."'

References

Langchain Docs Expression Language - Runnable Interface

랭체인LangChain 노트

🪴 VaultByNr

Explorer

📝 Basic_LCEL 활용하기

1️⃣ Methods

📋 `invoke / ainvoke`

📋 `batch` / `abatch`

📋 `stream / astream`

📋 `stream_log`

📋 callbacks

방법 1

방법 2

방법 3

2️⃣ Runnable

📋 `RunnablePassthrough()`

📋 `RunnablePassthrough.assign()`

📋 `RunnableLambda()`

📋 `RunnableParallel()`

📋 `itemgetter`의 활용

Graph View

Table of Contents

Backlinks

🪴 VaultByNr

Explorer

📝 Basic_LCEL 활용하기

1️⃣ Methods

📋 invoke / ainvoke

📋 batch / abatch

📋 stream / astream

📋 stream_log

📋 callbacks

방법 1

방법 2

방법 3

2️⃣ Runnable

📋 RunnablePassthrough()

📋 RunnablePassthrough.assign()

📋 RunnableLambda()

📋 RunnableParallel()

📋 itemgetter의 활용

Graph View

Table of Contents

Backlinks

📋 `invoke / ainvoke`

📋 `batch` / `abatch`

📋 `stream / astream`

📋 `stream_log`

📋 `RunnablePassthrough()`

📋 `RunnablePassthrough.assign()`

📋 `RunnableLambda()`

📋 `RunnableParallel()`

📋 `itemgetter`의 활용