Who is building LLM Chatbots, and what issues are you running into?

16 pointspetervandijck2y ago11 comments

Heya, like probably everyone, we are building some internals LLM chatbots for customers of ours. I'd love to hear hands-on insights for what people are doing, why, what's working for them/not working, etc.

11 comments

7 comments · 2 top-level

BoorishBears2y ago· 3 in thread

Chain of thought is underutilized. It almost never makes sense to show the user the "bare" response of the LLM. It's so easy to have LLMs self-critique, think through user intent, etc. to drastically improve the final output

petervandijckOP2y ago

Improve probably, but speed also matter, I worry a lot about chaining lots of LLM calls together, each of them takes a bunch of seconds, and then the experience becomes just really slow.

BoorishBears2y ago

You don't make multiple calls, you ask for a structured output and use some of the keys as the "chain links" in the chain of thought

1 more reply

quickthrower22y ago

Yeah you better be 99.9% perfect if you are going to take 60 seconds of plain spinner time to come back to me.

1 more reply

petervandijckOP2y ago· 2 in thread

For example, for us, we are building an LLM chatbot that pulls in the data of a technical book publisher. They have 20 years of technical books, and 20 years of videotaped conference talks.

Hard:

- We're using LangChain, which isn't always great

- The data pipeline was trickier than I had initially thought

- Indexing embeddings (in PostGres) is just hard (requires tons of ram)

But the hardest thing has been working on conversation quality. We've started to use LangSmith, which was a godsend for tracing and observability, and came out fairly recently. But it's not perfect and I wish there were better tools out there.

jondwillis2y ago

What do you find lacking in LangSmith?

I have been using it since the week it was in private beta, albeit a lot less recently, and thought it was good, though with some confusing UX and a handful of bugs.

petervandijckOP2y ago

Just generally a lot of abstractions that seem sometimes overwrought, and seem to hide details.

1 more reply

j / k navigate · click thread line to collapse