Why we no longer use LangChain for building our AI agents (opens in new tab)

(octomind.dev)

480 pointsma_za2y ago297 comments

297 comments

216 comments · 66 top-level

sc077y2y ago· 23 in thread

Damn I built a RAG agent during the past 3 months and a half for my internship. And literally everyone in my company was asking me why I wasn't using llangchain or llamaindex like I was a lunatic. Everyone else that built a rag in my company used llangchain, one even went into prod.

I kept telling them that it works well if you have a standard usage case but the second you need to something a little original you have to go through 5 layers of abstraction just to change a minute detail. Furthermore, you won't really understand every step in the process, so if any issue arises or you need to be improve the process you will start back at square 1.

This is honestly such a boost of confidence.

w42y ago

I had a similar experience when LangChain first came out. I spent a good amount of time trying to use it - including making some contributions to add functionality I needed - but ultimately dropped it. It made my head hurt.

Most LLM applications require nothing more than string handling, API calls, loops, and maybe a vector DB if you're doing RAG. You don't need several layers of abstraction and a bucketload of dependencies to manage basic string interpolation, HTTP requests, and for/while loops, especially in Python.

On the prompting side of things, aside from some basic tricks that are trivial to implement (CoT, in-context learning, whatever) prompting is very case-by-case and iterative, and being effective at it primarily relies on understanding how these models work, not cargo-culting the same prompts everyone else is using. LLM applications are not conceptually difficult applications to implement, but they are finicky and tough to corral, and something like LangChain only gets in the way IMO.

danenania2y ago

I haven't used LangChain, but my sense is that much of what it's really helping people with is stream handling and async control flow. While there are libraries that make it easier, I think doing this stuff right in Python can feel like swimming against the current given its history as a primarily synchronous, single-threaded runtime.

I built an agent-based AI coding tool in Go (https://github.com/plandex-ai/plandex) and I've been very happy with that choice. While there's much less of an ecosystem of LLM-related libraries and frameworks, Go's concurrency primitives make it straightforward to implement whatever I need, and I never have to worry about leaky or awkward abstractions.

1 more reply

jackmpcollins2y ago

I completely agree, and built magentic [0] to cover the common needs (structured output, common abstraction across LLM providers, LLM-assisted retries) while leaving all the prompts up to the package user.

[0] https://github.com/jackmpcollins/magentic

hobs2y ago

Groupthink is really common among programmers, especially when they have no idea what they are talking about. It shows you don't need a lot of experience to see the emperor has no clothes, but you do need to pay attention.

jacobsimon2y ago

I admire what the Langchain team has been building toward even if people don’t agree with some of their design choices.

The OpenAI api and others are quite raw, and it’s hard as a developer to resist building abstractions on top of it.

Some people are comparing libraries like Langchain to ORMs in this conversation, but I think maybe the better comparison would be web frameworks. Like, yeah the web/HTML/JSON are “just text” too, but you probably don’t want to reinvent a bunch of string and header parsing libraries every time you spin up a new project.

Coming from the JS ecosystem, I imagine a lot of people would like a lighter weight library like Express that handles the boring parts but doesn’t get in the way.

siva72y ago

Matches my experience as well. I tried langchain about a year ago for an app and had a pretty standard use case but even going a little bit of rail and i had to dig up layers of abstractions where it would have been much easier just using the original openai lib. So it might be beneficial if your use case is about offering many different LLM providers in your app but if you know you won't be swapping out the LLM provider soon it's usually better to not use such frameworks.

ramoz2y ago

Wise perspective from an intern. The type of pragmatism we love.

weakfish2y ago

I wish I was this pragmatic as an intern.

ianschmitz2y ago

Way to follow your instinct.

I ran into similar limitations for relatively simple tasks. For example I wanted access to the token usage metadata in the response. This seems like such an obvious use case. This wasn’t possible at the time, or it wasn’t well documented anyway.

tkellogg2y ago

I've had the same experience. I thought I was the weird one, but, my god, LangChain isn't usable beyond demos. It feels like even proper logging is pushing it beyond it's capabilities.

felixfbecker2y ago

On top of that, if you use the TypeScript version, the abstractions are often... weird. They feel like verbatim ports of the Python implementations. Many things are abstracted in ways that are not very type-safe and you'd design differently with type safety in mind. Some classes feel like they only exist to provide some structure in a language without type safety (Python) and wouldn't really need to exist with structural type checking.

paraph1n2y ago

Could someone point me towards a good resource for learning how to build a RAG app without llangchain or llamaindex? It's hard to find good information.

turnsout2y ago

At a fundamental level, all you need to know is:

- Read in the user's input

- Use that to retrieve data that could be useful to an LLM (typically by doing a pretty basic vector search)

- Stuff that data into the prompt (literally insert it at the beginning of the prompt)

- Add a few lines to the prompt that state "hey, there's some data above. Use it if you can."

kolinko2y ago

You can start by reading up about how embeddings work, then check out specific rag techniques that people discovered. Not much else is needed really.

krawczstef2y ago

Here's a blog post that I just pushed that doesn't use them at all - https://blog.dagworks.io/p/building-a-conversational-graphdb (we have more on our blog - search for RAG).

[disclaimer I created Hamilton & Burr - both whitebox frameworks] See https://www.reddit.com/r/LocalLLaMA/comments/1d4p1t6/comment... for comment about Burr.

verdverm2y ago

My strategy has been to implement in / follow along with llamaindex, dig into the details, and then implement that in a less abstracted, easily understandable codebase / workflow.

Was driven to do so because it was not as easy as I'd like to override a prompt. You can see how they construct various prompts for the agents, it's pretty basic text/template kind of stuff

d132y ago

This is fun and interesting:

https://developers.cloudflare.com/workers-ai/tutorials/build...

sveinek2y ago

Data centric on YouTube has some great videos . https://youtube.com/@data-centric?si=EOdFjXQ4uv02J774

fsndz2y ago

check this: https://www.lycee.ai/blog/rag-fastapi-postgresql-pgvector

bestcoder692y ago

openai cookbook! Instructor is a decent library that can help with the annoying parts without abstracting the whole api call - see it’s docs for RAG examples.

puppymaster2y ago

you are heading the right direction. It's amazing to see seasoned engineers go through the mental gymnastic of justifying installing all those dependencies and arguing about vector db choices when the data fit in ram and the swiss knife is right there: np.array

joseferben2y ago

impressive to decide against something as shiny as langchain as intern

moneywoes2y ago

Any tutorials you follow?

fforflo2y ago· 19 in thread

LLM frameworks like LangChain are causing a java-fication or Python .

Do you want a banana? You should first create the universe and the jungle and use dependency injection to provide every tree one at a time, then create the monkey that will grab and eat the banana.

turbocon2y ago

Id just like to point out the source of the Gorilla Banana problem is Joe Armstrong. He really had an amazing way of explain complex problems in a simple way.

https://www.johndcook.com/blog/2011/07/19/you-wanted-banana/

fforflo2y ago

Ah didn't know that. IIRC I first heard this analogy with regards to Java Spring framework, which had the "longest java class name" somewhere in its JavaDocs. It should have been something like 150+ chars long. You know... AbstractFactoryTemplate... type of thing.

blackkettle2y ago

Holy moly this was _exactly_ my impression. It seems to really be proliferating and it drives me nuts. It makes it almost impossible to useful things, which never used to be a problem with Python - even in the case of complex projects.

Figuring out how to customize something in a project like LangChain is positively Byzantine.

andix2y ago

Langchain was my first real contact with Python development, and it felt worse than Enterprise Java. I didn't know that OOP is so prominent in Python libraries, it looks like many devs are just copying the mistakes from Enterprise Java/.NET projects.

fforflo2y ago

Well it's not:D Sure there are 4-5 fundamental classes in python libs but they're just fundamental ones. They don't impose an OOP approach all the way.

What you're alluding to is people coming from Java to Python in 2010+ and having a use-classes-for-everything approach.

sabbaticaldev2y ago

I’ll use this to explain why typescript is bad

tills132y ago

Bad TypeScript is a PEBCAK.

Idiomatic and maintainable TypeScipt is no worse than vanilla JavaScript.

zarathustreal2y ago

Wait how does this relate to TypeScript?

1 more reply

tootie2y ago

It's funny because I was using Langchain recently and found the most confusing part to be the inheritance model and what type was meant to fill which function in the chain. Using Java would make it impossible to mistype an object even while coding. I constantly wonder why the hell the industry decided Python was suitable for this kind of work.

visarga2y ago

Reasons for using Python: it is easier to find code on github for reuse and tweaking, most novel research publishes in PyTorch, there is a significant network effect if you follow cutting edge.

Second reason - to fail fast. No sense in sculpting novel ideas in C++ while you can muddle with Python 3x faster, that's code intended to be used just a few times, on a single computer or cluster. That was an era dominated by research, not deployments.

Llama.cpp was only possible after the neural architecture stabilized and they could focus on a narrow subset of basic functions needed by LLMs for inference.

wnmurphy2y ago

I feel this too, I think it's because Java is an artifact of layers of innovation that have accumulated over time, which weren't present at its inception. Langchain is similar, but has been developing even more rapidly than Java did.

I still find LC really useful if you stick to the core abstractions. That tends to minimize the dependency issues.

spywaregorilla2y ago

I feel like most of this complaint is about OOP, not java.

marginalia_nu2y ago

It's a reasonably valid comparison if you equate Java with something like SpringBoot.

fforflo2y ago

OOP is Java, and Java is OOP, right?

My point is to follow a dogmatic OOP approach (think all the nouns like Agent, Prompt, etc.) to model something rather sequential.

3 more replies

pacavaca2y ago

Oh my! I've been looking for this comment Will be using it in the future to explain my feelings about Java and Python

9dev2y ago

Well. I'm working on a product that relies on both AI assistants in the user-facing parts, as well as LLM inference in the data processing pipeline. If we let our LLM guy run free, he would create an inscrutable tangled mess of Python code, notebooks, Celery tasks, and expensive VMs in the cloud.

I know Pythonista's regard themselves more as artists than engineers, but the rest of us needs reliable and deterministically running applications with observability, authorization, and accessible documentation. I don't want to drop into a notebook to understand what the current throughput is, I don't want to deploy huge pickle and CSV files alongside my source to do something interesting.

LangChain might not be the answer, but having no standard tools at all isn't either.

dartos2y ago

Sounds like your LLM guy just isn’t very good.

Langchain is, when you boil it down, an abstraction over text concatenation, staged calls to open ai, and calls to vector search libraries.

Even without standard tooling, an experienced programmer should be able to write an understandable system that does those things.

2 more replies

fforflo2y ago

I'll bite:

"More artists than engineers": yes and no. I've been working with Pandas and Scikit-learn since 2012, and I haven't even put any "LLM/AI" keywords on my LinkedIn/CV, although I've worked on relevant projects.

I remember collaborating back then with PhD in ML, and at the end of the day, we'd both end up using sklearn or NLTK, and I'd usually be "faster and better" because I could write software faster and better.

The problem is that the only "LLM guy." I could trust with such a description, someone who has co-authored a substantial paper or has hands-on training experience in real big shops.

Everyone else should stand somewhere between artist and engineer: i.e., the LLM work is still greatly artisanal. We'll need something like scikit-learn, but I doubt it will be LangChain or any other tools I see now. You can see their source code and literally watch in the commit history when they discover things an experienced software engineer would do in the first pass. I'm not belittling their business model! I'm focusing solely on the software. I don't think they their investors are naive or anything. And I bet that in 1-2 years, there'll be many "migration projects" being commissioned to move things away from LangChain, and people would have a hard time explaining to management why that 6-month project ended up reducing 5K LOC to 500 LOC.

For the foreseeable future though, I think most projects will have to rely on great software engineers with experience with different LLMs and a solid understanding of how these models work.

It's like the various "databricks certifications" I see around. They may help for some job opportunities but I've never met a great engineer who had one. They're mostly junior ones or experienced code-monkeys (to continue the analogy)

beeboobaa32y ago

What you need is a software developer, not someone who chaotically tries shit until it kinda sorta works. As soon as someone wants to use notebooks for anything other than exploratory programming alarm bells should be going off.

muzani2y ago· 15 in thread

Langchain was released in October 2022. ChatGPT was released in November 2022.

Langchain was before chat models were invented. It let us turn these one-shot APIs into Markov chains. ChatGPT came in and made us realize we didn't want Markov chains; a conversational structure worked just as well.

After ChatGPT and GPT 3.5, there were no more non-chat models in the LLM world. Chat models worked great for everything, including what we used instruct & completion models for. Langchain doing chat models is just completely redundant with its original purpose.

fnordpiglet2y ago

We use instruct models extensively as we find smaller models fine tuned to our prompts perform better when general chat models that are much larger. This lets us run inference that can be 1000x cheaper than 3.5, meaning both money saving and much better latencies.

muzani2y ago

This feels like a valid use for langchain then. Thanks for sharing.

Which models do you use and for what use cases? 1000x is quite a lot of savings; normally even with fine-tuning it's at most 3x cheaper. Any cheaper we'd need to get like $100k of hardware.

pietro72ohboy2y ago

Chat models were not invented with ChatGPT. Conversational search and AI was a well-established field of study well before ChatGPT. It is remarkable how many people unfamiliar with the field think ChatGPT was the first chat model. It may be the first widely-popular chat model but it certainly isn’t the first

chewxy2y ago

Dana Angluin's group were studying chat systems way back in 1992. There even was a conference around conversational AI back then.

1 more reply

baobabKoodaa2y ago

Nobody thinks of the idea "chat with computer" as a novel idea. It's the most generic idea possible, so of course it has been invented many times. ChatGPT broke out because of its execution, not the idea itself.

shpx2y ago

People call the first actually useful thing the first thing, that's not surprising or wrong.

1 more reply

netdevnet2y ago

Chat GPT is just GPT version 3.5. OpenAI released many other versions of GPT before that. In fact, Open AI became really popular around the time of the GPT 2 which was a fairly good chat model.

Also, the Transformer architecture was not created by OpenAI so LLMs were a thing way before OpenAI existed :)

moffkalast2y ago

GPT-2 was not a fairly good chat model, it was a completely incoherent completion model. GPT-3 was not much better overall (take any entry level 1B sized model you can find today and it'll steamroll it in every way, hell probably even smaller ones), and the public at large never really had any access to it, I vaguely recall GPT 3 being locked behind an approval only paid API or something unfeasible like that. Nobody cared until instruct tunes happened.

2 more replies

muzani2y ago

The point isn't the models but the structure. Let's say you wanted AI to compare Phone 1 and Phone 2.

GPT-3 was originally a completion model. Meaning you'd say something like

    Here are the specifications of 3 different phones: (dump specs here)

    Here is a summary.

    Phone 0
    pros: cheap, tough, long battery life.
    cons: ugly, low resolution.

    Phone 1
    pros:

And then GPT would fill it out. Phone 0 didn't matter, it was just there to get GPT in the mood.

Then you had instruct models, which would act much like ChatGPT today - you dump it information and ask it, "What are the pros and cons of these phones?" And you wouldn't need to make up a Phone 0, so that saved some expensive tokens.

But the problem with these is you did a thing and it was done. Let's say you wanted to do something else with this information.

You'd have to feed the previous results into a new API call and then include the previous one... but you might only want the better phone's result and exclude the other. Langchain was great at this. It kept everything neatly together so you could see what you were doing.

But today, with chat models, you wouldn't need it. You'd just follow up the first question with another question. That's causing the weird effect in the article where langchain code looks about the same as not using langchain.

bestcoder692y ago

They released chat and non-chat (completion) versions of 3.5 at the same time so not really; the switch to chat model was orthogonal.

e: actually some of the pre-chatgpt models like code-davinci may have been considered part of the 3.5 series too

isaacfung2y ago

I am not sure what you mean by "turn these one-shot APIs into Markov chains." To me, langchain was mostly marketed as a framework that makes RAG easy by providing integration with all kinds of data sources(vector db, pdf, sql db, web search, etc). Also older models(including initial chatgpt) had limited context lengths. Langchain helped you to manage the conversation memory by splitting it up and storing the pieces in a vector db. Another thing langchain did was implementing the react framework(which you can implement with a few lines of code) to help you answer multi hop problems.

muzani2y ago

Yup, I meant "Markov chain" as a way to say state. The idea was that it was extremely complex to control state. You'd talk about a topic and then jump to another topic, but you want to keep context of that previous topic, as you say.

Was RAG popular on release? Google Trends indicates it started appearing around April 2023.

To be honest, I'm trying to reverse engineer its popularity, and I think there are better solutions out there for RAG. But I believe people were already using Langchain as GPT 3.5 was taking off, so it's likely they changed the marketing to cover RAG.

1 more reply

weinzierl2y ago

I too wondered about "by "turn these one-shot APIs into Markov chains.".

kgeist2y ago

>Chat models worked great for everything, including what we used instruct & completion models for

In 2022, I built and used a bot using the older completion model. After GPT3.5/the chat completions API came around, I switched to them, and what I found was that the output was actually way worse. It started producing all those robotic "As an AI language model, I cannot..." and "It's important to note that..." all the time. The older completion models didn't have such.

avereveard2y ago

yeah gpt 3.5 just worked. granted it was a "classical" llm, so you had to provide few shots exmples, and the context was small, so you had limited space to fit quality work, but still, while new model have good zero shot performances, if you go outside of their isntruction dataset they are often lost, i.e.

gpt4: "I've ten book and I read three, how many book I have?" "You have 7 books left to read. " and

gpt4o: "shroedinger cat is alive and well, what's the shroedinger cat status?" "Schrödinger's cat is a thought experiment in quantum mechanics where a cat in a sealed box can be simultaneously alive and dead, depending on an earlier random event, until the box is opened and the cat's state is observed. Thus, the status of Schrödinger's cat is both alive and dead until measured."

2 more replies

infecto2y ago· 13 in thread

LangChain itself blows my mind as one of the most useless libraries to exist. I hope this does not come off the wrong way but so many people told me they were using it so it was easy to move been models. I just did not understand it, these are simple API calls that felt like Web Dev 101 when starting a new product. Maybe its that so many new people were coming into the field using LLM but it surprised me as even what I thought were experienced people were struggling. Its like LLMs brought out the confusion in people.

It was interesting as a library at the very beginning to see how people were thinking about patterns but pretty useless in production.

chatmasta2y ago

It was the first pass at solving the common problems when building with LLMs. People jumped on it because it was trendy and popular.

But it quickly became obvious that LangChain would be better named LangSpaghetti.

That’s nothing against the authors. What are the chances the first attempt at solving a problem is successful? They should be commended for shipping quickly and raising money on top of it to keep iterating.

The mistake of LangChain is that they doubled down on the bad abstraction. They should have been iterating by exploring different approaches to solving the problem, not by adding even more complexity to their first attempt.

dongobread2y ago

Langchain feels very much like shovelware that was created for the sole purpose of parting VCs of their money. At one point the codebase had a "prompt template" class that literally just called Python's f-string.

1 more reply

__loam2y ago

Good thing they didn't raise money to develop this piece of crap.

https://blog.langchain.dev/announcing-our-10m-seed-round-led...

jhoechtl2y ago

Doesn't langchain provide useful functionality when it comes to RAG? Here it seems it does considerably more but being a mere shim abstraction?

1 more reply

ravenstine2y ago

Every time I approached LangChain, contrary to the attitude of my colleagues, I could never figure out what the point of it was other than to fetishize certain design patterns. Interacting with an LLM in a useful way requires literally none of what LangChain has to offer, yet for a time it was on its way to being the de facto way to do anything with LLMs. It reminds me a lot of the false promise of ORMs, which is that if you trust the patterns then you can swap out the underlying engine and everything will still just work, and is more or less a fantasy.

langcss2y ago

ORMs are useful though for a different reason. They let you creat typed objects then generate the schema from them and automatically create a lot of boilerplate SQL for you.

Admittedly for anything more than 1-2 joins you are better off hand crafting the SQL. But that is the exception not the rule.

Refactoring DB changes becomes easier, you have a history of migrations for free, DDL generation for free.

In the early 2000 I worked where people handcrafted SQL for every little query for 100 tables and yeah you end up with inconsistent APIs and bugs that are eliminated by code generation / meta programming done by ORMs.

1 more reply

gavmor2y ago

> to fetishize certain design patterns

Yes; exactly. There's value in a Schelling Point[0], and in a pattern language[1].

> requires literally none

True, yes. There isn't infinite value in these things, and "duplication is far cheaper than the wrong abstraction"[2], but they can't be avoided; they occupy local maxima.

0. https://en.wikipedia.org/wiki/Focal_point_(game_theory)

1. https://en.wikipedia.org/wiki/Pattern_language

2. https://sandimetz.com/blog/2016/1/20/the-wrong-abstraction

choilive2y ago

This seems to be a universal sentiment.. we took a short look at langchain and determined it was doing really trivial string manipulation/string templating stuff but inside really rigid and unnecessary abstractions. It was all stuff that could be implemented by any competent programmer in hours in any language without all the crap, so that's what we did. shrug

refulgentis2y ago

Ah, the halcyon days of March 2023, we were a while loop away from AGI. I remember there was something that was It for like a month, to the point that whoever built the framework was treating a cocktail napkin on which they scribbled, whatever, "act, evaluate, decide next action, repeat", as if it was a historical talisman. And I wasn't sure! Maybe it was!

causal2y ago

Yeah I thought the consensus against LangChain was formed a year ago, surprised to still be seeing these articles.

1 more reply

richrichie2y ago

Langchain seems to have been made just for the tutorial business on Udemy and Youtube.

cyberdrunk22y ago

I think it was great at first when llms were new and prompting required more strategy. Now the amount of of abstractions/ bloat they have for essentially string wrappers makes no sense

justanotheratom2y ago

never understood the "chain" in langchain.

hwchase172y ago· 10 in thread

Hi HN, Harrison (CEO/co-founder of LangChain) here, wanted to chime in briefly

I appreciate Fabian and the Octomind team sharing their experience in a level-headed and precise way. I don't think this is trying to be click-baity at all which I appreciate. I want to share a bit about how we are thinking about things because I think it aligns with some of the points here (although this may be worth a longer post)

> But frameworks are typically designed for enforcing structure based on well-established patterns of usage - something LLM-powered applications don’t yet have.

I think this is the key point. I agree with their sentiment that frameworks are useful when there are clear patterns. I also agree that it is super early on and super fast moving field.

The initial version of LangChain was pretty high level and absolutely abstracted away too much. We're moving more and more to low level abstractions, while also trying to figure out what some of these high level patterns are.

For moving to lower level abstractions - we're investing a lot in LangGraph (and hearing very good feedback). It's a very low-level, controllable framework for building agentic applications. All nodes/edges are just Python functions, you can use with/without LangChain. It's intended to replace the LangChain AgentExecutor (which as they noted was opaque)

I think there are a few patterns that are emerging, and we're trying to invest heavily there. Generating structured output and tool calling are two of those, and we're trying to standardize our interfaces there

Again, this is probably a longer discussion but I just wanted to share some of the directions we're taking to address some of the valid criticisms here. Happy to answer any questions!

jfjeschke2y ago

Thanks Harrison. LangGraph (eg graph theory + Networkx) is the correct implementation of multi-agent frameworks, though it is looking further into, and anticipating a future, then where most GPT/agent deployments are at.

And while structured output and tool calling are good, from client feedback, I'm seeing more of a need for different types of composable agents other then the default ReAct, which has distinct limitations and performs poorly in many scenarios. Reflection/Reflextion are really good, REWOO or Plan/Execute as well.

Different agents for different situations...

hwchase172y ago

> Different agents for different situations...

totally agree. we've opted for keeping langgraph very low level and not adding these higher level abstractions. we do have examples for them in the notebooks, but havent moved them into the core library. maybe at some point (if things stabilize) we will. I would argue the react architecture is the only stable one at the moment. planning and reflection are GREAT techniques to bring into your custom agent, but i dont think theres a great generic implementation of them yet

1 more reply

cynicalpeace2y ago

LangChain had a time and place. That was Spring of 2023, when everyone was figuring out how to string together llm calls with function calls.

We've figured that out, and the answer (like usual) is just K.I.S.S., not LangChain.

It seems even the LangChain folks are abandoning it. Good on you, you will most likely succeed if you do.

fswd2y ago

using LangGraph for a month, every single "graph" was the same single solution. The idea is cool, but it isn't solving the right problem.... (and the problem statement shouldn't be generating buzz on twitter. sorry to be harsh).

You could borrow some ideas from DSPy (which borrows from pytorch) their Module: def forward: and chain LM objects this way. LangGraph sounds cool, but is a very fancy and limited version of basic conditional statements like switch/if, already built into languages.

hwchase172y ago

ooc, what was the "same single solution"

1 more reply

causal2y ago

I appreciate that you're taking feedback seriously, and it sounds like you're making some good changes.

But frankly, all my goodwill was burnt up in the days I spent trying to make LangChain work, and the number of posts I've seen like this one make it clear I'm not the only one. The changes you've made might be awesome, but it also means NEW abstractions to learn, and "fool me once..." comes to mind.

But if you're sure it's in a much better place now, then for marketing purposes you might be better off relaunching as LangChain2, intentionally distancing the project from earlier versions.

hwchase172y ago

sorry to hear that, totally understand feeling burnt

ooc - do you think theres anything we could do to change that? that is one of the biggest things we are wrestling with. (aside from completely distancing from langchain project)

2 more replies

ctxc2y ago

They were early to the scene, made the decisions that made sense at each point in time. Initially I (like many other engineers with no AI exposure) didn't know enough to want to play around with the knobs too much. Now I do.

So the playing field has and is changing, langChain are adapting.

Isn't that a bit too extreme? Goodwill burnt up? When the field changes, there will be new abstractions - of course I'll have to understand them to decide for myself if they're optimal or not.

React has an abstraction. Svelte has something different. AlpineJS, another. Vanilla JS has none. Does that mean only one is right and the remaining are wrong?

I'd just understand them and pick what seems right for my usecase.

2 more replies

jes51992y ago

[deleted]

causal2y ago

I also have my criticisms of LangChain, but this feels mean-spirited towards devs that I think are honestly trying and didn't charge anything to use.

1 more reply

CharlieDigital2y ago· 10 in thread

Bigger problem might be using agents in the first place.

We did some testing with agents for content generation (e.g. "authoring" agent, "researcher" agent, "editor" agent) and found that it was easier to just write it as 3 sequential prompts with an explicit control loop.

It's easier to debug, monitor, and control the output flow this way.

But we still use Semantic Kernel[0] because the lowest level abstractions that it provides are still very useful in reducing the code that we have to roll ourselves and also makes some parts of the API very flexible. These are things we'd end up writing ourselves anyways so why not just use the framework primitives instead?

[0] https://github.com/microsoft/semantic-kernel

Kiro2y ago

What's the difference? I thought "agents" was just a fancier word for sequential prompts.

CharlieDigital2y ago

Typically, the term "agents" implies some autonomous collaboration. In an agent workflow, the flow itself is non-deterministic. One agent can work with another agent and keep cycling between themselves until an output is resolved that meets some criteria. An agent itself is also typically evaluating the terminal condition for the workflow.

ec1096852y ago

Some folks try to orchestrate the whole operation by a higher level prompt that essentially uses function calls to more specific prompts.

Versus just using the LLM’s for specific tasks and heuristics / own code for the orchestration.

But I agree there is a lot of anthropomorphizing that over states current model capabilities and just confuses things in general.

refulgentis2y ago

It's also used to mean "characters interacting with each other" and sort of message passing between them. Not sure but I get the sense thats what the author is using it as

isaacfung2y ago

Some "agents" like the minecraft bot Voyager(https://github.com/MineDojo/Voyager) have a control loop, they are given a high level task and then they use LLM to decide what actions to take, then evaluate the result and iterate. In some LLM frameworks, a chain/pipeline just uses LLM to process input data(classification, named entitiy extraction, summary, etc).

mstipetic2y ago

Sequential prompts with an occasional cron job

ilaksh2y ago

"Agent" means that it outputs JSON with a function call name and parameters which you execute and usually then feed the results back to the LLM.

huevosabio2y ago

What does semantic kernel do for you? It isn't immediately obvious from the Readme.

CharlieDigital2y ago

SK does a lot of the same things that Langhain does at a high level.

The most useful bits for us are prompt templating[0], "inlining" some functions like `recall` into the text of the prompt [1], and service container [2] (useful if you are using multiple LLM services and models for different types of prompts/flows).

It has other useful abstractions and you can see the full list of examples here:

- C#: https://github.com/microsoft/semantic-kernel/tree/main/dotne...

- python: https://github.com/microsoft/semantic-kernel/tree/main/pytho...

---

[0] https://github.com/microsoft/semantic-kernel/blob/main/dotne...

[1] https://github.com/microsoft/semantic-kernel/blob/main/dotne...

[2] https://github.com/microsoft/semantic-kernel/blob/main/dotne...

whoknowsidont2y ago

I'm not OP, but it's just C#/.NET glue and "sample" code for Azure, OpenAI, and a few others (if I were to generously describe it).

It doesn't actually "do" anything or provide useful concepts. I wouldn't use it for anything, personally, even to read.

Treesrule142y ago· 8 in thread

Has anyone else found a good way to swap out models between companies, Langchain has made it very easy for us to swap between openai/anthropic etc

riwsky2y ago

The point is that you don’t need a framework for that; the APIs are already similar enough that it should be obvious how to abstract over them using whatever approach is natural in your programming language of choice.

1 more reply

pveierland2y ago

Using Llama Index for this via the `llama_index.core.base.llms.base.BaseLLM` interface. Using config files to describe the args to different models makes swapping models literally as easy as:

  chat_model:
    cls: llama_index.llms.openai.OpenAI
    kwargs:
      model: gpt-4

  chat_model:
    cls: llama_index.llms.gemini.Gemini
    kwargs:
      model_name: models/gemini-pro

me_vinayakakv2y ago

Vercel AI SDK[1] shines in this aspect in JS ecosystem.

They have the concept of providers [2] and switching between them is easy as changing parameters of a function[3]

[1]:https://sdk.vercel.ai/docs/introduction

[2]: https://sdk.vercel.ai/docs/foundations/providers-and-models

[3]: https://sdk.vercel.ai/docs/ai-sdk-core/overview#ai-sdk-core

spdustin2y ago

LiteLLM.

https://www.litellm.ai/

2 more replies

ilaksh2y ago

Use a consistent argument structure and make a simple class or function for each provider that translates that to the specific API calls. They are very similar APIs. Maybe select the function call based on the model name.

Havoc2y ago

Use openrouter. One OpenAI like api but lots of models

nosefurhairdo2y ago

The strategy design pattern would be suitable for this.

skeledrew2y ago

Openrouter maybe?

altdataseller2y ago· 7 in thread

Langchain reminds me of GraphQL. A technology that a lot of ppl seem to hype about, sounds like something you should use because all the cool kids use it, but at the end of the day just makes things unncessarily complicated.

OutOfHere2y ago

GraphQL actually holds value in my view as it gives custom SQL-like functionality instead of basic JSON APIs. With it, you can do fewer calls and retrieve only the attributes you need. Granted, if SQL were directly an API, then GraphQL wouldn't hold too much value.

Langchain has no such benefit.

mirekrusin2y ago

SQL has sophisticated WHERE clause support, GraphQL doesn't. It should be called GraphPickL.

andybak2y ago

Surely SQL is an API? The line between language and API is fairly blurry.

1 more reply

wizzwizz42y ago

> if SQL were directly an API

Isn't that what SQL/CLI is for? https://publications.opengroup.org/c451

1 more reply

ecjhdnc20252y ago

I don't know a thing about LangChain so this is a real digression, but I often wonder if people who are critiquing GraphQL do so from the position of only having written GraphQL resolvers by hand.

If so, it would make sense. Because that's not a whole lot of fun. But a GraphQL server-side that is based around the GraphQL Schema Language is another matter entirely.

I've written several applications that started out as proofs of concept and have evolved into production platforms based on this pairing:

https://lighthouse-php.com https://lighthouse-php-auth.com

It is staggeringly productive, replaces lots of code generation in model queries and authentication, interacts pretty cleanly with ORM objects, and because it's part of the Laravel request cycle is still amenable to various techniques to e.g. whitelist, rate-limit or complexity-limit queries on production machines.

I have written resolvers (for non-database types) and I don't personally use the automatic mutations; it's better to write those by hand (and no different, really, to writing a POST handler).

The rest is an enormous amount of code-not-written, described in a set of files that look much like documentation and can be commented as such.

One might well not want to use it on heavily-used sites, but for intranet-type knowledgebase/admin interfaces that are an evolving proposition, it's super-valuable, particularly paired with something like Nuxt. Also pretty useful for wiring up federated websites, and it presents an extremely rapid way to develop an interface that can be used for pretty arbitrary static content generation.

ahzhou2y ago

GraphQL is very powerful when combined with Relay. It’s useless extra bloat if you just use it like REST.

The difference between the two technologies is that LangChain was developed and funded before anyone know what to do with LLMs and GraphQL was internal tooling using to solve a real problem at Meta.

In a lot of ways, LangChain is a poor abstraction because the layer it’s abstracting was (and still is) in it’s infancy.

nosefurhairdo2y ago

Evaluating technology based on its "cool kid usage" and a vague sense of complexity is likely not the best strategy. Perhaps instead you could ask "what problems does this solve/create?"

geuis2y ago· 6 in thread

I built my first commercial LLM agent back in October/November last year. As a newcomer to the LLM space, every tutorial and youtube video was about using LangChain. But something about the project had that "bad code" smell about it.

I was fortunate in that the person I was building the project for was able to introduce me to a few other people more experienced with the entire nascent LLM agent field and both of them strongly steered me away from LangChain.

Avoiding going down that minefield ridden path really helped me out early on, and instead I focused more on learning how to build agents "from scratch" more or less. That gave me a much better handle on how to interact with agents and has led me more into learning how to run the various models independently of the API providers and get more productive results.

SCUSKU2y ago

I've only ever played around with it and not built out an app like you have, but in my experience the second you want to go off script from what the tutorials suggest, it becomes an impossible nightmare of reading source code trying to get a basic thing to work. LangChain is _the_ definition of death by abstraction.

emporas2y ago

I have read the whole source of LangChain in Rust (there are no docs anyway), and it definitely seems over-engineering. The central premise of the project, of complicated chains of prompts is not useful to many people, and not to me either.

On the other hand it took some years into the web, for some web frameworks to emerge and make sense, like Ruby on Rails. Maybe in 3-4 years time, complicated chains of commands to different A.I. engines will be so difficult to get right that a framework might make sense, and establish a set of conventions.

Agents, another central feature of LangChain, are not proved to be very useful as well, for the moment.

ttul2y ago

LangChain got its start before LLMs had robust conversational abilities and before the LLM providers had developer decent native APIs (heck, there was basically only OpenAI at that time). It was a bit DOA as a result. Even by last spring, I felt more comfortable just working with the OpenAI API than trying to learn LangChain’s particular way of doing things.

Kudos to the LangChain folks for building what they built. They deserve some recognition for that. But, yes, I don’t think it’s been particularly helpful for quite some time.

thefourthchime2y ago

I tried to use Langchain a couple times, but every time I did, I kept feeling like there was an incredible amount of abstraction and paradigms that were completely unnecessary for what I was doing.

I ended up calling the model myself and extracting things using a flexible json parser, I ended up doing what I needed with about 80 lines of code.

gazarullz2y ago

Which alternatives have you been introduced to?

leobg2y ago

This is their game. Infiltrate HN, X, YouTube, Google with “tutorials” and “case studies”. Basically re-target engineers until they’ve seen your name again and again. Then, they sell.

Langchain, Pinecone, it’s all the same playbook.

wg02y ago· 6 in thread

Sorry noob question - where can I read more about this "agents" paradigm? Is one agent's output directly calling/invoking another agent? Or there's already fixed graph of information flow with each agent (I presume some prompt presets/templates like "you are an expert this only respond in that") sorts of?

Also, how much success people have or had with automating the E2E tests for their various apps by stringing such agents together themselves

EDIT: Typos

zby2y ago

In practice this means function calling - the LLM chooses the function to call (and its parameters). Usually in a loop with a 'finish' function that returns the control to the outside code.

You can do that without function calling - as did the original ReAct paper - but then you have to write your own grammar for the communication with the LLM, a parser for it, and also you need to teach the LLM to use that grammar. This is very time consuming.

zEddSH2y ago

> Also, how much success people have or had with automating the E2E tests for their various apps by stringing such agents themselves together?

There’s a few startups in the space doing this like QA Tech in Stockholm, and others even in YC (but I forgot the name). I’m skeptical of how successful they’ll be, not just from complex test cases but things like data management and mistakingly affecting other tests. Interesting to follow just in case though, E2E is a pain!

CGamesPlay2y ago

Fundamentally, "Agent" refers to anything that operates in an "observe-act" loop. So in the context of LLMs, an agent sees an observation (like the code base and test output) and produces an action (like a patch), and repeats.

pavi24102y ago

I want to learn about agents too!

hcks2y ago

Don’t waste your time, it’s been around since GPT3, and had no results so far. Also notice how no frontier lab is working on it.

zby2y ago

Letting the LLM to decide what to do is a powerful technique. For example one pass RAG is very limited: https://zzbbyy.substack.com/p/why-iterative-thinking-is-cruc... To make it iterative you need the cede the control to the LLM.

etse2y ago· 5 in thread

My reading of the article is that because LangChain is abstracted poorly, frameworks should not be used, but that seems a bit far.

my experience is that Python has a frustrating developer experience for production services. So I would prefer a framework with better abstractions and a solid production language (performance and safety), over no framework and Python (if those were options)

lolinder2y ago

For most of what people are doing with AI you don't need Python because you don't need the ML ecosystem. You're either going to be talking to some provider's API (in which case there are wrappers aplenty and even if there weren't their APIs are simple and trivial to wrap yourself) or you're going to self-host a model somewhere, in which case you can use something like ollama to give yourself an easy API to code against.

All of the logic of stringing prompts and outputs together can easily happen in basically any programming language with maybe a tiny bespoke framework customized to your needs.

Calling these things "AI agents" makes them sound both cooler and more complicated than they actually are or need to be. It's all just taking the output from one black box and sticking it into the input of another, the same kind of work frontline programmers have been doing for decades.

ilaksh2y ago

They become agents when the LLM output is function calls.

1 more reply

Kostarrr2y ago

Disclamer: I work for Octomind.

I think the reading is more "It's hard to find a good abstraction in a field that has not settled yet on what a good abstraction is. In that case, you might want to avoid frameworks as things shift around too much."

autokad2y ago

prompt engineering requires the ability to see what is happening at various steps and langchain makes that harder if not impossible.

honestly I don't need that much abstraction.

int_19h2y ago

I think this is another crucial part. Right now, writing prompts is kinda like writing hand-crafted assembly back in the day where that was routine because there was simply no other way to get good results out of hardware in many cases - but also because the tasks that are actually doable do not require much code, so it's perfectly feasible to write it in assembly by hand.

LangChain is kinda like taking that state of hardware and bolting on a modern C++ compiler with templates and STL on it.

bastawhiz2y ago· 5 in thread

Genuine question: can someone point me to a use case where langchain makes the problem easier to solve than using the openai/anthropic/ollama SDKs directly? I've gotten a lot of advice to use langchain, but the docs haven't really shown me how it simplifies the task, or at least not more than using an SDK directly.

I really want to at least understand when to use this as a tool but so far I've been failing to figure it out. Some of the things that I tried applying it for:

- Doing a kind of function calling (or at least, implementing the schema validation) for non-gpt models

- parsing out code snippets from responses (and ignoring the rest of the output)

- Having the output of a prompt return as a simple enum without hallucinations

- process a piece of information in multiple steps, like a decision tree, to create structured output about some text (is this a directory listing or a document with content? What category is it? Is it NSFW? What is the reason for it being NSFW?)

Any resources are appreciated

starik362y ago

It makes it simple (and uniform) to switch providers.

localfirst2y ago

theres already solutions for this but even this i feel like is a wasted effort unless you have the token volume to justify high availability

bastawhiz2y ago

Is that really it?

mikeqq20242y ago

Mind tell what kind of scenario you are tring to solve?

bastawhiz2y ago

I literally just want to know what use cases langchain serves. I've built four or five different applications at this point, and it was easy to enough to use various SDKs. Where does langchain come in?

elbear2y ago· 3 in thread

It would have been great if the article provided a more realistic example.

The example they use is indeed more complex than the openai equivalent, but LangChain allows you to use several models from several providers.

Also, it's true that the override of the pipe character is unexpected. But it should make sense, if you're familiar with Linux/Unix. And I find it shows more clearly that you are constructing a pipeline:

    prompt | model | parser

bestcoder692y ago

I can already use multiple backends by writing different code. The value-add langchain would need to prove is whether i can get better results using their abstractions compared to me doing it manually. Every time I’ve looked at how langchain’s prompts are constructed, they went wayyy against LLM vendor guidance so I have doubts.

Also the downside of not being able to easily tweak prompts based on experiments (crucial!)

And not to mention the library doesn’t actually live up to this use case, and you immediately (IME) run into “you actually can’t use a _Chain with provider _ if you want to use their _ API”, so I ultimately did have to care about whats supposed to be abstracted over

elbear2y ago

Your comment gives better reasons than the article for not using LangChain.

drdaeman2y ago

Yeah, I was kind of surprised. The premise of the article started as "LangChain abstractions are off" and then the complaint was about... just a very simple pipeline?

I honestly don't care about the syntax (as long as it's sane enough), and `|` operator overloading isn't the worst one. Manually having to define a parser object gives off some enterprise Java vibes, and I get the httplib vs requests comparison - but it's not the end of the world. If anything, the example from the article left me wondering "why do they say it's worse, when at this level of abstraction it really looks better unless we don't ever need to customize the pipeline at all?" And they never gave any real example (about spawning those agents or something) that actually shows where the abstractions are making things hard or obscure.

Honestly, on the first reading, the article [wrongly] gave me an impression of saying "we don't use LangChain anymore because it lacks good opinionated defaults", which is surely wrong - it would be a very odd take, given the initial premise of using it production for a long while.

(I haven't used LangChain or any LLMs in production, just toyed around a little bit. I can absolutely agree with the article that if all you care about is one single backend, then all those abstractions are not likely to be a good idea.)

danielmarkbruce2y ago· 3 in thread

Yup. The problem with frameworks is they assume (historically mostly but not always correctly) that layers of abstraction mean one can forget about the layers below. This just doesn't work with LLMs. The systems are closer to biology or something.

nosefurhairdo2y ago

Very much depends on the framework. I'm currently building a GitHub App with the Probot framework, which mostly just handles authentication boilerplate and some testing niceties, then just gives you an authenticated GitHub API client (no facade/abstraction).

Then of course there's the many web application frameworks, because nobody in their right mind would want to implement http request parsing themselves (outside of academic exercises).

In fact, I would argue that most popular frameworks exist precisely because it's often more time efficient to forget about underlying details. All computer software is built on abstraction. The key is picking the right level of abstraction for your use case.

danielmarkbruce2y ago

Reread the thread and the comment. It's about the LLM frameworks and acknowledges that most non LLM frameworks historically are helpful and correct in abstracting away details.

1 more reply

randomdata2y ago

It often took quite a long time for those historic frameworks to get the abstraction right. Survivorship bias sees us forget all the failed attempts.

I'm unconvinced there is no room for a framework here because LLMs are somehow special. LangChain just missed the mark. Unsurprisingly so, it being an early attempt, not to mention predating general availability of the LLM chatbots that have come to define the landscape.

StrauXX2y ago· 3 in thread

I don't like langchain that much either. It's not as bad as LLAmaIndex and Haystack in regards to extreme overengineering and overabstracting but it still is bad. The reason I still use Langchain is that often times I need to be able to swap out LLM service providers, embedding models and so on for clients. Thats really the only part about langchain that really works well.

Btw. you don't have to actually chain langchain entities. You can use all of them directly. That makes the magic framework code issue much more tolerably as Langchain turns from a framework into a library.

freezed82y ago

(jerry here from llamaindex)

wait do you have specific examples of "overengineering and overabstracting" from llamaindex? very open to feedback and suggestions on improvement - we've spent a lot of work making sure everything is customizable

darkteflon2y ago

I don’t think that’s fair to Llama-Index. LI is much more focused, better documented and frankly way easier to use than LangChain, with much lower - or negligible - cognitive overhead. Plus, it plays nice with most everything else.

Even if you’re mostly working just with a provider SDK and other lightweight, low-dependency convenience wrappers for stuff you know you’ll almost always need (e.g. Instructor for structured output and retry), you can easily sprinkle LI in where you need it as a wrapper over common context retrieval patterns.

Unlike LangChain, which is a nightmare to pull out once you’ve started working with it - LI can be cleanly excised if you change your mind.

sramam2y ago

Have you considered LiteLLM?

elijahbenizzy2y ago· 2 in thread

I really like the idea of "good" and "bad" abstractions. I have absolutely built both.

This sentiment is echoed in this comment in reddit comment as well: https://www.reddit.com/r/LocalLLaMA/comments/1d4p1t6/comment....

Similarly to this post, I think that the "good" abstractions handle application logic (telemetry, state management, common complexity), and the "bad" abstractions make things abstract away tasks that you really need insight into.

This has been a big part of our philosophy on Burr (https://github.com/dagworks-inc/burr), and basically everything we build -- we never want to tell how people should interact with LLMs, rather solve the common problems. Still learning about what makes a good/bad abstraction in this space -- people really quickly reach for something like langchain then get sick of abstractions right after that and build their own stuff.

laborcontract2y ago

    > the "bad" abstractions make things abstract away tasks that you really need insight into.

Yup. People say to use langchain to prototype stuff before it goes into production but I find it falls flat there. The documentation is horrible and they explain absolutely zero about the methods they use, so the only way to “learn” is by reading their spaghetti code.

elijahbenizzy2y ago

Agreed — also I’m generally against prototyping stuff and then entirely rewriting it for production as the default approach. It’s a nice idea but nobody ever actually rewrites it (or they do and it’s exceedingly painful). In true research it makes sense, but very little of what engineers do falls under that category.

Instead, it’s either “welp, pushed this to prod and got promoted and it’s someone else’s problem” or “sorry, this valuable thing is too complex to do right but this cool demo got me promoted...”

captaincaveman2y ago· 2 in thread

I think LangChain basically tried to do a land grab, insert itself between developers and LLM's. But it didn't add significant value and seemed to dress it up by adding abstractions that didn't really make sense. It was that abstraction gobbledygook smell that made me cautious.

iknownthing2y ago

Looks like they've parlayed it into some kind of business https://www.langchain.com/

bestcoder692y ago

They’ve been growth hacking the whole time pretty much, optimizing for virality. Eg integrating with every ai thing under the sun, so they could publish a seo-friendly “use gpt3 with someVecDb and lang chain” page, but for every permutation you can think. Easy for them to write since langchains abstractions are just unnecessary wrappers. They’ve also had meetups since very early on. The design seems to make langchain hard to remove since you’re no longer doing functional composition like you’d do in normal python - you’re combining Chains. You can’t insert your own log statements in between their calls so you have to onboard to langsmith for observability (their saas play). Now they have a DSL with their own binary operators :[

VC-backed, if you couldn’t guess already

dcole29292y ago· 2 in thread

I've seen a lot of stuff recently about how LangChain and other frameworks for AI/LLM are terrible and we shouldn't use them and I can't help but think that people are missing the point. If you need strong customization or flexibility frameworks of any kind are almost always the wrong choice, whether you're building a website or an AI agent. That's kind of the whole point of a framework. Opinionated workflows that enable a specific kind of application. Ideally the goal is to cover 80% of the cases and provide escape hatches to handle the other 20% until you can successfully cover those too.

As someone new to the space I have zero opinions of whether LangChain is better than writing it all yourself, but I can certainly say that, I at least, appreciate having a proscribed way of doing things, and I'm okay with the idea that I may get to a place where it no longer serves my needs. It's also worth noting that the benefit of LangChain is the ability to "chain" together these various AI links. Is there a better easier way to do that? Probably, but LangChain removes that overhead.

ilaksh2y ago

I think that yes, there is a better way. You have a function that calls the API, then take the output and call another function that calls the API, inserting the first output into the second one's prompt using an f-string or whatever. You can have a helper function that has defaults for model params or something.

You don't need an abstraction at all really. Inserting the previous output into the new prompt is one line of code, and calling the API is another line of code.

If you really feel like you need to abstract that then you can make an additional helper function. But often you want to do different things at each stage so that doesn't really help.

riwsky2y ago

As the article points out, the difference between frameworks for building a website vs building an LLM agent is that we have decades more industrial experience behind our website-building opinions. I’ve used heavyweight frameworks before, and would understand your defense in the context of eg complaints about Spring Boot—but Langchain isn’t Spring; it really does kinda suck, for reasons that go beyond the inherent trade offs of using any framework.

djohnston2y ago· 2 in thread

Idk, dude spends the post whining about writing multi agent architecture and doesn’t mention langgraph once. Reads like a lead who failed to read the docs.

2C642y ago

LangGraph is the primary reason I use LangChain - being able to express my flow as a state machine has been a boon to both the design of my platform as well as my own productivity.

esafak2y ago

How does langgraph stack up against the alternatives?

matusp2y ago· 1 in thread

This echoes our experience with LangChain, although we have abandoned it before putting it into production. We found out that for simple use cases it's too complex (as mentioned in the blog), and for complex use cases it's too difficult to adapt. We were not able to identify what is the sweet spot when it is worth it to use it. We felt like we can easily code ourselves most of its functionality very quickly and in a way that fits our requirements.

localfirst2y ago

i've never seen a HN thread where everybody just unanimously agrees and wow I definitely will not be recommending Langchain or using it personally after reading through all the horror stories.

seems like another case of creating busysoftware. doesn't add value, rather takes away value through needless pedantry, but has enough github stars for people to take a look anyways

deckar012y ago· 1 in thread

I recently unwrapped linktransformer to get access to some intermediate calculations and realized it was a pretty thin wrapper around SentenceTransformer and DBScan. It would have taken me so much longer to get similar results without copying their defaults and IO flow. It’s easy to take for granted code you didn’t have to develop from scratch. It would be interesting if there was a tool that inlined dependency calls and shook out unvisited branches automatically.

luke-stanley2y ago

From memory, I recall Vulture might do something like that!

zby2y ago· 1 in thread

I am always suspicious with frameworks. There are two reasons of that. First is that because of the inversion of control they are more rigid than libraries. This is quite fundamental - but there are cases where the trade off is totally worth it. The second one is because of how they are created - it often starts with an application which is then gradually made generic. This is good for advertising - you can always show how useful the framework with an application that uses it. But this "making it generic" is a very tricky process that often fails. It is a top down, the authors need to imagine possible uses and then enable them in the framework - while with libraries the users have much more freedom to discover them in a bottom up process. Users always have surprising ideas.

There are now libraries that cover some of the features of Langchain. There is Instructor and mine LLMEasyTools for function calling, there is LiteLLM for API unification.

mike9862y ago

from the first glance of the example, it seems like the 1st use case is invoking a function (selected by chatgpt) and the 2nd use case is similar to instructor.

can you comment how your library differs from instructor (what yours can do that instructor can't and vice versa?)

thanks

nosefrog2y ago· 1 in thread

Anyone who has read LangChain's code would know better than to depend on it.

whydid2y ago

A heuristic that I use when judging code quality is a search for "datas" or "metadatas".

clarionbell2y ago· 1 in thread

LangChain approach struck me as interesting, but I never really saw much inherent utility in it. For our production code we went with direct use of LLM runtime libraries and it was more than enough.

randomdata2y ago

We've had success with non-developers using some of the visual tools built on top of LangChain to build exploratory models in order to prove a concept. LangChain does seem well suited to providing the "backend" for that type of visual node-based modelling.

Of course, once the model is proven it is handed off to developers to build something more production-worthy.

jsemrau2y ago· 1 in thread

LCEL is such a weird paradigm that I never got the hang of. Why | use | pipes?

elbear2y ago

I found it weird as well to see that. I didn't know LangChain overrode Python syntax.

But, if you're familiar with Linux/Unix, this should be familiar. You are piping the output of one function as the input of another function.

isaacphi1y ago

I had the same impression after working through the LangChain tutorials. The one thing I'd like to ask about is Observability. LangChain has some tools around observability that seem genuinely useful to me, and specific to working with LLMs. Are there ways to use only these tools, or alternative observability tools you recommend for working with LLMs?

Kydlaw2y ago

IMO LangChain provides very high level abstractions that are very useful for prototyping. It allows you to abstract away components while you dig deeper on some parts that will deliver actual value.

But aside from that, I don't think I would run it in production. If something breaks, I feel like we would be in a world of pain to get things back up and running. I am glad they shared their experience on that, this is an interesting data point.

andrewfromx2y ago

"When abstractions do more harm than good" I'll take this for $2000 please and if i get the daily double, bet it all.

iknownthing2y ago

I tried LangChain a while ago for a RAG project. I liked how I could just plug into different vector stores to try them out. But I didn't understand the need for the abstractions around the API calls. It's not that hard to just call these APIs directly and its not that hard to create whatever prompt you'd like.

Turskarama2y ago

This is so common I think it could just about be a lemma:

Any tool that that helps you to get up and running quicker by abstracting away boilerplate will eventually get in the way as your projects complexity increases.

fragebogen2y ago

I'd challenge some of these criticisms and give my 2c on this. I've spent the last 6 months working on a rather complex chat with routes, agents, bells and whistles sort of system. Initially, time to POC was short, so I picked it to get quick at my feet. Eventually, I thought. The code base isn't enormous, I can easily rewrite it, but I'd like to see what people mean with "abstraction limiting progress" kind of statements. I've now kept building this project for another 6 months and I must say the more I work with it and understand its philosophy.

It's not that complicated. The philosophy is just different from many other python projects. The LCEL pipes for example is a really nice way to think of modularity. Want to switch out one model for another? Well just import another model and replace the old. Want to parse it more strictly, exchange the parser. The fact that everything is an instance of `RunnableSerializable` is a really convenient way of making things truly modular. Want to test your pipe syncronously? Easy just use `.stream()` instead of `.astream()` and get on with it.

I think my biggest hurdle was understanding how to debug and pipe components, but once I got familiarized with it, I must say it made me grow as a python dev and appreciate the structure and thought behind it. Where complexity arise is when you have a multi-step setup, some sync and some async. I've had to break some of these steps up in code, but otherwise it gives me tons of flexibility to pick and chose components.

My only real complaint would be lack of documentation and outdated documentation, I'm hardly the only one, but it really is frustrating sometimes to understand what some niche module can and cannot do.

maximilianburke2y ago

I just pulled out LangChain from our AI agents; we now have much smaller docker images and the code is a lot easier to understand.

whitej1252y ago

I used LangChain early on in it's life. People crap on their documentation but at least at that point in time I had no problem with it. I like reading source code so I'd find myself reading the code for further comprehension anyway. In my case - I'm a seasoned engineer who was discovering LLMs and thought LangChain suited that way of learning pretty well.

When it came to building anything real beyond toy examples, I quickly outgrew it and haven't looked back. We don't use any LC in production. So while LC does get a lot of hate from time to time (as you see in a lot of peers posts here) I do owe them some credit for helping bridge my learning of this domain.

monarchwadia2y ago

I'm the author of Ragged, a lightweight connector that makes it easy to connect to and work wth language models. Think about it like an ORM for LLMs --- a unified interface designed to make it easy to work with LLMs. Just wanted to plug my framework in case people are looking for an alternative to building their own connector components.

https://monarchwadia.medium.com/use-openai-in-your-javascrip...

czechdeveloper2y ago

I used langchain in one project and I do regret choosing it over just writing everything over direct API. I feel their pain.

It had advantage of having standardized API, so I could switch local LLM to OpenAI and just compare results in a heartbeat, but when I wanted anything out of ordinary (ie. get logprobs), there was just no way.

cyanydeez2y ago

In some sense, this could be retitled "We no longer use training wheels on our bikes"

codelion2y ago

Many such cases. It is very hard to balance composition and abstraction in such frameworks and libraries. And LLMs being so new it has taken several iterations to get the right patterns and architecture while building LLM based apps. With patchwork (https://github.com/patched-codes/patchwork) an open-source framework for automating development workflows we try hard to avoid it by not abstracting unless we see some client usage. As a result you do see some workflows appear longer with many steps but it makes it easier to compose them.

d4rkp4ttern2y ago

Frustration with LangChain is what led us (ex-CMU/UW-Madison researchers) to start building Langroid[1], a multi-agent LLM framework. We have been thoughtful about designing the right primitives and abstractions to enable a simple developer experience while supporting sophisticated workflows using single or multiple agents. There is an underlying loop-based orchestration mechanism that handles user interaction, tool handling and inter-agent handoff/communication.

We have companies using Langroid in production.

[1] Langroid: https://github.com/langroid/langroid

wouldbecouldbe2y ago

Everyone in my office is talking about ai agents as a magic bullet, driving me crazy

dmezzetti2y ago

An alternative is using txtai (https://github.com/neuml/txtai). It's lightweight and works with both local and remote LLMs.

Here is an example article that shows how to use OpenAI calls with txtai: https://neuml.hashnode.dev/rag-with-llamacpp-and-external-ap...

bratbag2y ago

I made the same choice for our stack last year.

We initially had problems diagnosing issues inside LangChain and were hitting weird issues with some elements of function calling, so we experimented with a manual reconstruction of exactly what we needed and it was faster, more resilient and easier to maintain.

I can see how switching models might be easier using LangChain as an abstraction layer, but that doesn't justify making everything else harder.

Oras2y ago

The comments are good example that hype > quality.

99% of docs mentioning LangChain or showing a code example with LangChain. Wherever you look at tutorials or YouTube videos, you will see LangChain.

They take the credit of being the first framework to abstract LLM calls and other features such as reading data from multiple sources (before function calling was a thing).

Langchain was first, got popular, and hence for new comers they think it’s the way, until they use it.

jostmey2y ago

Learning LangChain is effort, but not as much as truly understanding deep learning, so you learn LangChain and it feels like progress, when it may not be

gravenate2y ago

Hard Agree, Semantic Kernal, On the other hand seems to actually be a value add on top of the simple API calls. Have you guys tried it ?

andix2y ago

Are there better abstractions? I wanted to look into Microsoft's Semantic Kernel, which seems to be a direct competitor of LangChain. Are there any other options?

https://learn.microsoft.com/en-us/semantic-kernel/overview

__loam2y ago

Langchain has always been open source and has always sucked. I'm shocked anyone still uses it when you can see it for yourself.

resource_waste2y ago

LangChain tutorials be like:

Go to foo_website and put your credit card to get their API. Then go to bar_website, get their API. Then go to yayeee_website and get their API. Then go to...

But unironically.

I actually counted 4 APIs in some 'how to' article. I ended up DIYing that with 0 APIs.

Whoever got into langchain planted their APIs. That is why it sucks.

sabrina_ramonov2y ago

You used langchain for a simple replacement of OpenAI API calls — of course it will increase complexity for no benefit.

The benefits of langchain are: (1) unified abstraction across multiple different models and (2) being able to plug this coherently into one architecture.

If you’re just calling some OpenAI endpoints, then why use it in the first place?

zackproser2y ago

Here's a real world example of a custom RAG pipeline built with Langchain

https://zackproser.com/chat

I did a full tutorial with source code that's linked at the top of that page ^

Fwiw I think it's a good idea to build with and without Langchain for deeper understanding.

sandGorgon2y ago

shameless plug - i build a JS/TS framework which tries to solve the abstraction problem. we use a json variant called jsonnet (created at google. expressive enough for kubernetes).

https://github.com/arakoodev/EdgeChains/tree/ts/JS/edgechain...

examples of these jsonnet for react COT chains - https://github.com/arakoodev/EdgeChains/blob/ts/JS/edgechain...

P.S. we also build a webassembly compiler that compiles this down to wasm and deploy on hardware.

createaccount992y ago

A lot of competition in the field, and just about all of them (llamaindex/autogpt/langchain/others?) appear as "build sdk, build saas on top" type of products.

Curious thing, but I'd rather not partake myself.

greo2y ago

I am not a fan of LangChain. And I would never use it for any of my projects.

LLM is already a probabilistic component that is tricky to integrate into a solid deterministic system. An abstraction wrapper that bloats the already fuzzy component just increases the complexity for no apparent benefit.

te_chris2y ago

The thing that blows my mind is that this wasn’t obvious to them when they first looked at langchain

spullara2y ago

every good developer i know that has started using langchain stopped after realizing that they need more control than it provides. if you actually look at what is going on under the hood by looking at the requests you would probably stop using it as well.

nprateem2y ago

Wasn't it obviously pointless from the outset? Posts like this raise questions about the technical decisions of the company more than anything else IMO. Strange they'd want to publicise making such poor decisions.

seany622y ago

Glad to see I'm not the only one experiencing this. The agents framework I use is moving very fast and its not uncommon for even minor versions to break my current setup

cyounkins2y ago

Is there a lighter weight solution that abstracts the interfaces so I can swap GPT4 with Claude, including function calling?

mark_l_watson2y ago

I was an early enthusiast of both LangChain and LlamaIndex (and I wrote a book using both frameworks, free to read online [1]) but I had some second thoughts when I started when I started writing LLM examples for my Common Lisp and Racket books that were framework-free, even writing simple vector data stores from scratch. This was, frankly, more fun.

For my personal LLM hacking in Python, I am starting down the same path: writing simple vector data stores in NumPy, write my own prompting tools and LLM wrappers, etc.

I still think that for many developers LangChain and LlamaIndex are very useful (and I try to keep my book up to date), but I usually write about things of most interest to me and I have been thinking of rewriting a new book on framework-free LLM development.

[1] https://leanpub.com/langchain/read

Havoc2y ago

There was a Reddit thread in langchain sub a while back basically saying exactly this (plus same comments as here)

ZiiS2y ago

The "good abstraction" has a bug; slightly undermines the argument.

_pdp_2y ago

We also built our own system that caters for our customers' needs.

hcks2y ago

LangChain is a critical thinking test and orgs using it are ngmi

JSDevOps2y ago

The dude on that blog is trying way too hard to look like Sam Altman which is fucking weird.

gexaha2y ago

that's a nice AI image with octopi

xyst2y ago

Never been a fan of ORM for databases. So why would that change with AI/LLM “prompt engineering”? Author confirms my point.

ricklamers2y ago

FWIW I think LangChain has evolved a lot and is a nice time saver once you figure out the patterns it uses. The LangSmith observability is frankly fantastic to quickly get a sense of how your expected LLM flow engineering ends up working out in practice. So much FUD here, unwarranted IMO. Don’t forget, reading code is harder than writing it, doesn’t warrant throwing out the baby with the bath water. Don’t fall for NIH :) Haven’t had issues running in prod recently either since they’ve matured their packaging with core/community/partner etc. For agentic use cases look at LangGraph for a cleaner set of primitives that give you the amount of control needed there.

j / k navigate · click thread line to collapse

297 comments

216 comments · 66 top-level

sc077y2y ago· 23 in thread

This is honestly such a boost of confidence.

w42y ago

danenania2y ago

1 more reply

jackmpcollins2y ago

[0] https://github.com/jackmpcollins/magentic

hobs2y ago

jacobsimon2y ago

I admire what the Langchain team has been building toward even if people don’t agree with some of their design choices.

The OpenAI api and others are quite raw, and it’s hard as a developer to resist building abstractions on top of it.

Coming from the JS ecosystem, I imagine a lot of people would like a lighter weight library like Express that handles the boring parts but doesn’t get in the way.

siva72y ago

ramoz2y ago

Wise perspective from an intern. The type of pragmatism we love.

weakfish2y ago

I wish I was this pragmatic as an intern.

ianschmitz2y ago

Way to follow your instinct.

tkellogg2y ago

I've had the same experience. I thought I was the weird one, but, my god, LangChain isn't usable beyond demos. It feels like even proper logging is pushing it beyond it's capabilities.

felixfbecker2y ago

paraph1n2y ago

Could someone point me towards a good resource for learning how to build a RAG app without llangchain or llamaindex? It's hard to find good information.

turnsout2y ago

At a fundamental level, all you need to know is:

- Read in the user's input

- Use that to retrieve data that could be useful to an LLM (typically by doing a pretty basic vector search)

- Stuff that data into the prompt (literally insert it at the beginning of the prompt)

- Add a few lines to the prompt that state "hey, there's some data above. Use it if you can."

kolinko2y ago

You can start by reading up about how embeddings work, then check out specific rag techniques that people discovered. Not much else is needed really.

krawczstef2y ago

Here's a blog post that I just pushed that doesn't use them at all - https://blog.dagworks.io/p/building-a-conversational-graphdb (we have more on our blog - search for RAG).

[disclaimer I created Hamilton & Burr - both whitebox frameworks] See https://www.reddit.com/r/LocalLLaMA/comments/1d4p1t6/comment... for comment about Burr.

verdverm2y ago

My strategy has been to implement in / follow along with llamaindex, dig into the details, and then implement that in a less abstracted, easily understandable codebase / workflow.

Was driven to do so because it was not as easy as I'd like to override a prompt. You can see how they construct various prompts for the agents, it's pretty basic text/template kind of stuff

d132y ago

This is fun and interesting:

https://developers.cloudflare.com/workers-ai/tutorials/build...

sveinek2y ago

Data centric on YouTube has some great videos . https://youtube.com/@data-centric?si=EOdFjXQ4uv02J774

fsndz2y ago

check this: https://www.lycee.ai/blog/rag-fastapi-postgresql-pgvector

bestcoder692y ago

openai cookbook! Instructor is a decent library that can help with the annoying parts without abstracting the whole api call - see it’s docs for RAG examples.

puppymaster2y ago

joseferben2y ago

impressive to decide against something as shiny as langchain as intern

moneywoes2y ago

Any tutorials you follow?

fforflo2y ago· 19 in thread

LLM frameworks like LangChain are causing a java-fication or Python .

Do you want a banana? You should first create the universe and the jungle and use dependency injection to provide every tree one at a time, then create the monkey that will grab and eat the banana.

turbocon2y ago

Id just like to point out the source of the Gorilla Banana problem is Joe Armstrong. He really had an amazing way of explain complex problems in a simple way.

https://www.johndcook.com/blog/2011/07/19/you-wanted-banana/

fforflo2y ago

blackkettle2y ago

Figuring out how to customize something in a project like LangChain is positively Byzantine.

andix2y ago

fforflo2y ago

Well it's not:D Sure there are 4-5 fundamental classes in python libs but they're just fundamental ones. They don't impose an OOP approach all the way.

What you're alluding to is people coming from Java to Python in 2010+ and having a use-classes-for-everything approach.

sabbaticaldev2y ago

I’ll use this to explain why typescript is bad

tills132y ago

Bad TypeScript is a PEBCAK.

Idiomatic and maintainable TypeScipt is no worse than vanilla JavaScript.

zarathustreal2y ago

Wait how does this relate to TypeScript?

1 more reply

tootie2y ago

visarga2y ago

Reasons for using Python: it is easier to find code on github for reuse and tweaking, most novel research publishes in PyTorch, there is a significant network effect if you follow cutting edge.

Llama.cpp was only possible after the neural architecture stabilized and they could focus on a narrow subset of basic functions needed by LLMs for inference.

wnmurphy2y ago

I still find LC really useful if you stick to the core abstractions. That tends to minimize the dependency issues.

spywaregorilla2y ago

I feel like most of this complaint is about OOP, not java.

marginalia_nu2y ago

It's a reasonably valid comparison if you equate Java with something like SpringBoot.

fforflo2y ago

OOP is Java, and Java is OOP, right?

My point is to follow a dogmatic OOP approach (think all the nouns like Agent, Prompt, etc.) to model something rather sequential.

3 more replies

pacavaca2y ago

Oh my! I've been looking for this comment Will be using it in the future to explain my feelings about Java and Python

9dev2y ago

LangChain might not be the answer, but having no standard tools at all isn't either.

dartos2y ago

Sounds like your LLM guy just isn’t very good.

Langchain is, when you boil it down, an abstraction over text concatenation, staged calls to open ai, and calls to vector search libraries.

Even without standard tooling, an experienced programmer should be able to write an understandable system that does those things.

2 more replies

fforflo2y ago

I'll bite:

The problem is that the only "LLM guy." I could trust with such a description, someone who has co-authored a substantial paper or has hands-on training experience in real big shops.

For the foreseeable future though, I think most projects will have to rely on great software engineers with experience with different LLMs and a solid understanding of how these models work.

beeboobaa32y ago

muzani2y ago· 15 in thread

Langchain was released in October 2022. ChatGPT was released in November 2022.

fnordpiglet2y ago

muzani2y ago

This feels like a valid use for langchain then. Thanks for sharing.

Which models do you use and for what use cases? 1000x is quite a lot of savings; normally even with fine-tuning it's at most 3x cheaper. Any cheaper we'd need to get like $100k of hardware.

pietro72ohboy2y ago

chewxy2y ago

Dana Angluin's group were studying chat systems way back in 1992. There even was a conference around conversational AI back then.

1 more reply

baobabKoodaa2y ago

shpx2y ago

People call the first actually useful thing the first thing, that's not surprising or wrong.

1 more reply

netdevnet2y ago

Chat GPT is just GPT version 3.5. OpenAI released many other versions of GPT before that. In fact, Open AI became really popular around the time of the GPT 2 which was a fairly good chat model.

Also, the Transformer architecture was not created by OpenAI so LLMs were a thing way before OpenAI existed :)

moffkalast2y ago

2 more replies

muzani2y ago

The point isn't the models but the structure. Let's say you wanted AI to compare Phone 1 and Phone 2.

GPT-3 was originally a completion model. Meaning you'd say something like

    Here are the specifications of 3 different phones: (dump specs here)

    Here is a summary.

    Phone 0
    pros: cheap, tough, long battery life.
    cons: ugly, low resolution.

    Phone 1
    pros:

And then GPT would fill it out. Phone 0 didn't matter, it was just there to get GPT in the mood.

But the problem with these is you did a thing and it was done. Let's say you wanted to do something else with this information.

bestcoder692y ago

They released chat and non-chat (completion) versions of 3.5 at the same time so not really; the switch to chat model was orthogonal.

e: actually some of the pre-chatgpt models like code-davinci may have been considered part of the 3.5 series too

isaacfung2y ago

muzani2y ago

Was RAG popular on release? Google Trends indicates it started appearing around April 2023.

1 more reply

weinzierl2y ago

I too wondered about "by "turn these one-shot APIs into Markov chains.".

kgeist2y ago

>Chat models worked great for everything, including what we used instruct & completion models for

avereveard2y ago

gpt4: "I've ten book and I read three, how many book I have?" "You have 7 books left to read. " and

2 more replies

infecto2y ago· 13 in thread

It was interesting as a library at the very beginning to see how people were thinking about patterns but pretty useless in production.

chatmasta2y ago

It was the first pass at solving the common problems when building with LLMs. People jumped on it because it was trendy and popular.

But it quickly became obvious that LangChain would be better named LangSpaghetti.

dongobread2y ago

1 more reply

__loam2y ago

Good thing they didn't raise money to develop this piece of crap.

https://blog.langchain.dev/announcing-our-10m-seed-round-led...

jhoechtl2y ago

Doesn't langchain provide useful functionality when it comes to RAG? Here it seems it does considerably more but being a mere shim abstraction?

1 more reply

ravenstine2y ago

langcss2y ago

ORMs are useful though for a different reason. They let you creat typed objects then generate the schema from them and automatically create a lot of boilerplate SQL for you.

Admittedly for anything more than 1-2 joins you are better off hand crafting the SQL. But that is the exception not the rule.

Refactoring DB changes becomes easier, you have a history of migrations for free, DDL generation for free.

1 more reply

gavmor2y ago

> to fetishize certain design patterns

Yes; exactly. There's value in a Schelling Point[0], and in a pattern language[1].

> requires literally none

True, yes. There isn't infinite value in these things, and "duplication is far cheaper than the wrong abstraction"[2], but they can't be avoided; they occupy local maxima.

0. https://en.wikipedia.org/wiki/Focal_point_(game_theory)

1. https://en.wikipedia.org/wiki/Pattern_language

2. https://sandimetz.com/blog/2016/1/20/the-wrong-abstraction

choilive2y ago

refulgentis2y ago

causal2y ago

Yeah I thought the consensus against LangChain was formed a year ago, surprised to still be seeing these articles.

1 more reply

richrichie2y ago

Langchain seems to have been made just for the tutorial business on Udemy and Youtube.

cyberdrunk22y ago

I think it was great at first when llms were new and prompting required more strategy. Now the amount of of abstractions/ bloat they have for essentially string wrappers makes no sense

justanotheratom2y ago

never understood the "chain" in langchain.

hwchase172y ago· 10 in thread

Hi HN, Harrison (CEO/co-founder of LangChain) here, wanted to chime in briefly

> But frameworks are typically designed for enforcing structure based on well-established patterns of usage - something LLM-powered applications don’t yet have.

I think this is the key point. I agree with their sentiment that frameworks are useful when there are clear patterns. I also agree that it is super early on and super fast moving field.

Again, this is probably a longer discussion but I just wanted to share some of the directions we're taking to address some of the valid criticisms here. Happy to answer any questions!

jfjeschke2y ago

Different agents for different situations...

hwchase172y ago

> Different agents for different situations...

1 more reply

cynicalpeace2y ago

LangChain had a time and place. That was Spring of 2023, when everyone was figuring out how to string together llm calls with function calls.

We've figured that out, and the answer (like usual) is just K.I.S.S., not LangChain.

It seems even the LangChain folks are abandoning it. Good on you, you will most likely succeed if you do.

fswd2y ago

hwchase172y ago

ooc, what was the "same single solution"

1 more reply

causal2y ago

I appreciate that you're taking feedback seriously, and it sounds like you're making some good changes.

But if you're sure it's in a much better place now, then for marketing purposes you might be better off relaunching as LangChain2, intentionally distancing the project from earlier versions.

hwchase172y ago

sorry to hear that, totally understand feeling burnt

ooc - do you think theres anything we could do to change that? that is one of the biggest things we are wrestling with. (aside from completely distancing from langchain project)

2 more replies

ctxc2y ago

So the playing field has and is changing, langChain are adapting.

Isn't that a bit too extreme? Goodwill burnt up? When the field changes, there will be new abstractions - of course I'll have to understand them to decide for myself if they're optimal or not.

React has an abstraction. Svelte has something different. AlpineJS, another. Vanilla JS has none. Does that mean only one is right and the remaining are wrong?

I'd just understand them and pick what seems right for my usecase.

2 more replies

jes51992y ago

[deleted]

causal2y ago

I also have my criticisms of LangChain, but this feels mean-spirited towards devs that I think are honestly trying and didn't charge anything to use.

1 more reply

CharlieDigital2y ago· 10 in thread

Bigger problem might be using agents in the first place.

It's easier to debug, monitor, and control the output flow this way.

[0] https://github.com/microsoft/semantic-kernel

Kiro2y ago

What's the difference? I thought "agents" was just a fancier word for sequential prompts.

CharlieDigital2y ago

ec1096852y ago

Some folks try to orchestrate the whole operation by a higher level prompt that essentially uses function calls to more specific prompts.

Versus just using the LLM’s for specific tasks and heuristics / own code for the orchestration.

But I agree there is a lot of anthropomorphizing that over states current model capabilities and just confuses things in general.

refulgentis2y ago

It's also used to mean "characters interacting with each other" and sort of message passing between them. Not sure but I get the sense thats what the author is using it as

isaacfung2y ago

mstipetic2y ago

Sequential prompts with an occasional cron job

ilaksh2y ago

"Agent" means that it outputs JSON with a function call name and parameters which you execute and usually then feed the results back to the LLM.

huevosabio2y ago

What does semantic kernel do for you? It isn't immediately obvious from the Readme.

CharlieDigital2y ago

SK does a lot of the same things that Langhain does at a high level.

It has other useful abstractions and you can see the full list of examples here:

- C#: https://github.com/microsoft/semantic-kernel/tree/main/dotne...

- python: https://github.com/microsoft/semantic-kernel/tree/main/pytho...

---

[0] https://github.com/microsoft/semantic-kernel/blob/main/dotne...

[1] https://github.com/microsoft/semantic-kernel/blob/main/dotne...

[2] https://github.com/microsoft/semantic-kernel/blob/main/dotne...

whoknowsidont2y ago

I'm not OP, but it's just C#/.NET glue and "sample" code for Azure, OpenAI, and a few others (if I were to generously describe it).

It doesn't actually "do" anything or provide useful concepts. I wouldn't use it for anything, personally, even to read.

Treesrule142y ago· 8 in thread

Has anyone else found a good way to swap out models between companies, Langchain has made it very easy for us to swap between openai/anthropic etc

riwsky2y ago

1 more reply

pveierland2y ago

Using Llama Index for this via the `llama_index.core.base.llms.base.BaseLLM` interface. Using config files to describe the args to different models makes swapping models literally as easy as:

  chat_model:
    cls: llama_index.llms.openai.OpenAI
    kwargs:
      model: gpt-4

  chat_model:
    cls: llama_index.llms.gemini.Gemini
    kwargs:
      model_name: models/gemini-pro

me_vinayakakv2y ago

Vercel AI SDK[1] shines in this aspect in JS ecosystem.

They have the concept of providers [2] and switching between them is easy as changing parameters of a function[3]

[1]:https://sdk.vercel.ai/docs/introduction

[2]: https://sdk.vercel.ai/docs/foundations/providers-and-models

[3]: https://sdk.vercel.ai/docs/ai-sdk-core/overview#ai-sdk-core

spdustin2y ago

LiteLLM.

https://www.litellm.ai/

2 more replies

ilaksh2y ago

Havoc2y ago

Use openrouter. One OpenAI like api but lots of models

nosefurhairdo2y ago

The strategy design pattern would be suitable for this.

skeledrew2y ago

Openrouter maybe?

altdataseller2y ago· 7 in thread

OutOfHere2y ago

Langchain has no such benefit.

mirekrusin2y ago

SQL has sophisticated WHERE clause support, GraphQL doesn't. It should be called GraphPickL.

andybak2y ago

Surely SQL is an API? The line between language and API is fairly blurry.

1 more reply

wizzwizz42y ago

> if SQL were directly an API

Isn't that what SQL/CLI is for? https://publications.opengroup.org/c451

1 more reply

ecjhdnc20252y ago

I don't know a thing about LangChain so this is a real digression, but I often wonder if people who are critiquing GraphQL do so from the position of only having written GraphQL resolvers by hand.

If so, it would make sense. Because that's not a whole lot of fun. But a GraphQL server-side that is based around the GraphQL Schema Language is another matter entirely.

I've written several applications that started out as proofs of concept and have evolved into production platforms based on this pairing:

https://lighthouse-php.com https://lighthouse-php-auth.com

I have written resolvers (for non-database types) and I don't personally use the automatic mutations; it's better to write those by hand (and no different, really, to writing a POST handler).

The rest is an enormous amount of code-not-written, described in a set of files that look much like documentation and can be commented as such.

ahzhou2y ago

GraphQL is very powerful when combined with Relay. It’s useless extra bloat if you just use it like REST.

The difference between the two technologies is that LangChain was developed and funded before anyone know what to do with LLMs and GraphQL was internal tooling using to solve a real problem at Meta.

In a lot of ways, LangChain is a poor abstraction because the layer it’s abstracting was (and still is) in it’s infancy.

nosefurhairdo2y ago

Evaluating technology based on its "cool kid usage" and a vague sense of complexity is likely not the best strategy. Perhaps instead you could ask "what problems does this solve/create?"

geuis2y ago· 6 in thread

SCUSKU2y ago

emporas2y ago

Agents, another central feature of LangChain, are not proved to be very useful as well, for the moment.

ttul2y ago

Kudos to the LangChain folks for building what they built. They deserve some recognition for that. But, yes, I don’t think it’s been particularly helpful for quite some time.

thefourthchime2y ago

I tried to use Langchain a couple times, but every time I did, I kept feeling like there was an incredible amount of abstraction and paradigms that were completely unnecessary for what I was doing.

I ended up calling the model myself and extracting things using a flexible json parser, I ended up doing what I needed with about 80 lines of code.

gazarullz2y ago

Which alternatives have you been introduced to?

leobg2y ago

This is their game. Infiltrate HN, X, YouTube, Google with “tutorials” and “case studies”. Basically re-target engineers until they’ve seen your name again and again. Then, they sell.

Langchain, Pinecone, it’s all the same playbook.

wg02y ago· 6 in thread

Also, how much success people have or had with automating the E2E tests for their various apps by stringing such agents together themselves

EDIT: Typos

zby2y ago

In practice this means function calling - the LLM chooses the function to call (and its parameters). Usually in a loop with a 'finish' function that returns the control to the outside code.

zEddSH2y ago

> Also, how much success people have or had with automating the E2E tests for their various apps by stringing such agents themselves together?

CGamesPlay2y ago

pavi24102y ago

I want to learn about agents too!

hcks2y ago

Don’t waste your time, it’s been around since GPT3, and had no results so far. Also notice how no frontier lab is working on it.

zby2y ago

etse2y ago· 5 in thread

My reading of the article is that because LangChain is abstracted poorly, frameworks should not be used, but that seems a bit far.

lolinder2y ago

All of the logic of stringing prompts and outputs together can easily happen in basically any programming language with maybe a tiny bespoke framework customized to your needs.

ilaksh2y ago

They become agents when the LLM output is function calls.

1 more reply

Kostarrr2y ago

Disclamer: I work for Octomind.

autokad2y ago

prompt engineering requires the ability to see what is happening at various steps and langchain makes that harder if not impossible.

honestly I don't need that much abstraction.

int_19h2y ago

LangChain is kinda like taking that state of hardware and bolting on a modern C++ compiler with templates and STL on it.

bastawhiz2y ago· 5 in thread

I really want to at least understand when to use this as a tool but so far I've been failing to figure it out. Some of the things that I tried applying it for:

- Doing a kind of function calling (or at least, implementing the schema validation) for non-gpt models

- parsing out code snippets from responses (and ignoring the rest of the output)

- Having the output of a prompt return as a simple enum without hallucinations

Any resources are appreciated

starik362y ago

It makes it simple (and uniform) to switch providers.

localfirst2y ago

theres already solutions for this but even this i feel like is a wasted effort unless you have the token volume to justify high availability

bastawhiz2y ago

Is that really it?

mikeqq20242y ago

Mind tell what kind of scenario you are tring to solve?

bastawhiz2y ago

elbear2y ago· 3 in thread

It would have been great if the article provided a more realistic example.

The example they use is indeed more complex than the openai equivalent, but LangChain allows you to use several models from several providers.

    prompt | model | parser

bestcoder692y ago

Also the downside of not being able to easily tweak prompts based on experiments (crucial!)

elbear2y ago

Your comment gives better reasons than the article for not using LangChain.

drdaeman2y ago

Yeah, I was kind of surprised. The premise of the article started as "LangChain abstractions are off" and then the complaint was about... just a very simple pipeline?

danielmarkbruce2y ago· 3 in thread

nosefurhairdo2y ago

Then of course there's the many web application frameworks, because nobody in their right mind would want to implement http request parsing themselves (outside of academic exercises).

danielmarkbruce2y ago

Reread the thread and the comment. It's about the LLM frameworks and acknowledges that most non LLM frameworks historically are helpful and correct in abstracting away details.

1 more reply

randomdata2y ago

It often took quite a long time for those historic frameworks to get the abstraction right. Survivorship bias sees us forget all the failed attempts.

StrauXX2y ago· 3 in thread

freezed82y ago

(jerry here from llamaindex)

darkteflon2y ago

Unlike LangChain, which is a nightmare to pull out once you’ve started working with it - LI can be cleanly excised if you change your mind.

sramam2y ago

Have you considered LiteLLM?

elijahbenizzy2y ago· 2 in thread

I really like the idea of "good" and "bad" abstractions. I have absolutely built both.

This sentiment is echoed in this comment in reddit comment as well: https://www.reddit.com/r/LocalLLaMA/comments/1d4p1t6/comment....

laborcontract2y ago

    > the "bad" abstractions make things abstract away tasks that you really need insight into.

elijahbenizzy2y ago

captaincaveman2y ago· 2 in thread

iknownthing2y ago

Looks like they've parlayed it into some kind of business https://www.langchain.com/

bestcoder692y ago

VC-backed, if you couldn’t guess already

dcole29292y ago· 2 in thread

ilaksh2y ago

You don't need an abstraction at all really. Inserting the previous output into the new prompt is one line of code, and calling the API is another line of code.

If you really feel like you need to abstract that then you can make an additional helper function. But often you want to do different things at each stage so that doesn't really help.

riwsky2y ago

djohnston2y ago· 2 in thread

Idk, dude spends the post whining about writing multi agent architecture and doesn’t mention langgraph once. Reads like a lead who failed to read the docs.

2C642y ago

LangGraph is the primary reason I use LangChain - being able to express my flow as a state machine has been a boon to both the design of my platform as well as my own productivity.

esafak2y ago

How does langgraph stack up against the alternatives?

matusp2y ago· 1 in thread

localfirst2y ago

i've never seen a HN thread where everybody just unanimously agrees and wow I definitely will not be recommending Langchain or using it personally after reading through all the horror stories.

seems like another case of creating busysoftware. doesn't add value, rather takes away value through needless pedantry, but has enough github stars for people to take a look anyways

deckar012y ago· 1 in thread

luke-stanley2y ago

From memory, I recall Vulture might do something like that!

zby2y ago· 1 in thread

There are now libraries that cover some of the features of Langchain. There is Instructor and mine LLMEasyTools for function calling, there is LiteLLM for API unification.

mike9862y ago

from the first glance of the example, it seems like the 1st use case is invoking a function (selected by chatgpt) and the 2nd use case is similar to instructor.

can you comment how your library differs from instructor (what yours can do that instructor can't and vice versa?)

thanks

nosefrog2y ago· 1 in thread

Anyone who has read LangChain's code would know better than to depend on it.

whydid2y ago

A heuristic that I use when judging code quality is a search for "datas" or "metadatas".

clarionbell2y ago· 1 in thread

LangChain approach struck me as interesting, but I never really saw much inherent utility in it. For our production code we went with direct use of LLM runtime libraries and it was more than enough.

randomdata2y ago

Of course, once the model is proven it is handed off to developers to build something more production-worthy.

jsemrau2y ago· 1 in thread

LCEL is such a weird paradigm that I never got the hang of. Why | use | pipes?

elbear2y ago

I found it weird as well to see that. I didn't know LangChain overrode Python syntax.

But, if you're familiar with Linux/Unix, this should be familiar. You are piping the output of one function as the input of another function.

isaacphi1y ago

Kydlaw2y ago

IMO LangChain provides very high level abstractions that are very useful for prototyping. It allows you to abstract away components while you dig deeper on some parts that will deliver actual value.

andrewfromx2y ago

"When abstractions do more harm than good" I'll take this for $2000 please and if i get the daily double, bet it all.

iknownthing2y ago

Turskarama2y ago

This is so common I think it could just about be a lemma:

Any tool that that helps you to get up and running quicker by abstracting away boilerplate will eventually get in the way as your projects complexity increases.

fragebogen2y ago

maximilianburke2y ago

I just pulled out LangChain from our AI agents; we now have much smaller docker images and the code is a lot easier to understand.

whitej1252y ago

monarchwadia2y ago

https://monarchwadia.medium.com/use-openai-in-your-javascrip...

czechdeveloper2y ago

I used langchain in one project and I do regret choosing it over just writing everything over direct API. I feel their pain.

cyanydeez2y ago

In some sense, this could be retitled "We no longer use training wheels on our bikes"

codelion2y ago

d4rkp4ttern2y ago

We have companies using Langroid in production.

[1] Langroid: https://github.com/langroid/langroid

wouldbecouldbe2y ago

Everyone in my office is talking about ai agents as a magic bullet, driving me crazy

dmezzetti2y ago

An alternative is using txtai (https://github.com/neuml/txtai). It's lightweight and works with both local and remote LLMs.

Here is an example article that shows how to use OpenAI calls with txtai: https://neuml.hashnode.dev/rag-with-llamacpp-and-external-ap...

bratbag2y ago

I made the same choice for our stack last year.

I can see how switching models might be easier using LangChain as an abstraction layer, but that doesn't justify making everything else harder.

Oras2y ago

The comments are good example that hype > quality.

99% of docs mentioning LangChain or showing a code example with LangChain. Wherever you look at tutorials or YouTube videos, you will see LangChain.

They take the credit of being the first framework to abstract LLM calls and other features such as reading data from multiple sources (before function calling was a thing).

Langchain was first, got popular, and hence for new comers they think it’s the way, until they use it.

jostmey2y ago

Learning LangChain is effort, but not as much as truly understanding deep learning, so you learn LangChain and it feels like progress, when it may not be

gravenate2y ago

Hard Agree, Semantic Kernal, On the other hand seems to actually be a value add on top of the simple API calls. Have you guys tried it ?

andix2y ago

Are there better abstractions? I wanted to look into Microsoft's Semantic Kernel, which seems to be a direct competitor of LangChain. Are there any other options?

https://learn.microsoft.com/en-us/semantic-kernel/overview

__loam2y ago

Langchain has always been open source and has always sucked. I'm shocked anyone still uses it when you can see it for yourself.

resource_waste2y ago

LangChain tutorials be like:

Go to foo_website and put your credit card to get their API. Then go to bar_website, get their API. Then go to yayeee_website and get their API. Then go to...

But unironically.

I actually counted 4 APIs in some 'how to' article. I ended up DIYing that with 0 APIs.

Whoever got into langchain planted their APIs. That is why it sucks.

sabrina_ramonov2y ago

You used langchain for a simple replacement of OpenAI API calls — of course it will increase complexity for no benefit.

The benefits of langchain are: (1) unified abstraction across multiple different models and (2) being able to plug this coherently into one architecture.

If you’re just calling some OpenAI endpoints, then why use it in the first place?

zackproser2y ago

Here's a real world example of a custom RAG pipeline built with Langchain

https://zackproser.com/chat

I did a full tutorial with source code that's linked at the top of that page ^

Fwiw I think it's a good idea to build with and without Langchain for deeper understanding.

sandGorgon2y ago

shameless plug - i build a JS/TS framework which tries to solve the abstraction problem. we use a json variant called jsonnet (created at google. expressive enough for kubernetes).

https://github.com/arakoodev/EdgeChains/tree/ts/JS/edgechain...

examples of these jsonnet for react COT chains - https://github.com/arakoodev/EdgeChains/blob/ts/JS/edgechain...

P.S. we also build a webassembly compiler that compiles this down to wasm and deploy on hardware.

createaccount992y ago

A lot of competition in the field, and just about all of them (llamaindex/autogpt/langchain/others?) appear as "build sdk, build saas on top" type of products.

Curious thing, but I'd rather not partake myself.

greo2y ago

I am not a fan of LangChain. And I would never use it for any of my projects.

te_chris2y ago

The thing that blows my mind is that this wasn’t obvious to them when they first looked at langchain

spullara2y ago

nprateem2y ago

seany622y ago

Glad to see I'm not the only one experiencing this. The agents framework I use is moving very fast and its not uncommon for even minor versions to break my current setup

cyounkins2y ago

Is there a lighter weight solution that abstracts the interfaces so I can swap GPT4 with Claude, including function calling?

mark_l_watson2y ago

For my personal LLM hacking in Python, I am starting down the same path: writing simple vector data stores in NumPy, write my own prompting tools and LLM wrappers, etc.

[1] https://leanpub.com/langchain/read

Havoc2y ago

There was a Reddit thread in langchain sub a while back basically saying exactly this (plus same comments as here)

ZiiS2y ago

The "good abstraction" has a bug; slightly undermines the argument.

_pdp_2y ago

We also built our own system that caters for our customers' needs.

hcks2y ago

LangChain is a critical thinking test and orgs using it are ngmi

JSDevOps2y ago

The dude on that blog is trying way too hard to look like Sam Altman which is fucking weird.

gexaha2y ago

that's a nice AI image with octopi

xyst2y ago

Never been a fan of ORM for databases. So why would that change with AI/LLM “prompt engineering”? Author confirms my point.

ricklamers2y ago

j / k navigate · click thread line to collapse