Show HN: Structured output from LLMs without reprompting (opens in new tab)

(automorphic.ai)

174 pointssandkoan2y ago54 comments

Built a tool for transforming unstructured data into structured outputs using language models (with 100% adherence).

If you're facing problems getting GPT to adhere to a schema (JSON, XML, etc.) or regex, need to bulk process some unstructured data, or generate synthetic data, check it out.

We run our own tuned model (you can self-host if you want), so, we're able to have incredibly fine grained control over text generation.

Repository: https://github.com/automorphic-ai/trex

Playground: https://automorphic.ai/playground

Show HN: Structured output from LLMs without reprompting

(automorphic.ai)

174 pointssandkoan2y ago54 comments

Built a tool for transforming unstructured data into structured outputs using language models (with 100% adherence).

If you're facing problems getting GPT to adhere to a schema (JSON, XML, etc.) or regex, need to bulk process some unstructured data, or generate synthetic data, check it out.

We run our own tuned model (you can self-host if you want), so, we're able to have incredibly fine grained control over text generation.

Repository: https://github.com/automorphic-ai/trex

Playground: https://automorphic.ai/playground

54 comments

46 comments · 12 top-level

behnamoh2y ago· 10 in thread

the more it goes, the more I realize that the true power of LLMs is not in unstructured text that they can generate, but in structured output. but there are two approaches to achieve this:

1. LMQL/guidance/JSONformer/OP's post

2. finetuning the model to understand function calls and their (potentially) JSON schemas.

there was a comment here about OpenAI's approach (finetuning a model to understand function call) which raised a good point: since finetuning is often forgetful (previous knowledge learnt by the model gets forgotten a little bit), it's not clear if OpenAI's approach has made GPT-4 less capable than it was before. Not to mention that you're still dealing with a statistical process (LLM), not a locked-in algorithm that generates the desired schema 100% the time.

Which brings me to the other approach: steering the LLM's output __as it is generating tokens__, which is what LMQL does. This results in less token usage (you don't send function schema as part of your prompt/message to OpenAI) and 100% accuracy because token probabilities are modified (e.g., 0% chance of any character except ":" after a double quotation mark).

brucethemoose22y ago

> Which brings me to the other approach: steering the LLM's output __as it is generating tokens__

A relevant PR:

https://github.com/ggerganov/llama.cpp/pull/1773

The plan is to support arbitrary grammar files to constrain token generation, similar to the grammar files here:

https://github.com/antlr/grammars-v4

kelseyfrog2y ago

I mean even Jsonformer used LogitsWarper when generating numbers, but yes arbitrary grammars are infinitely more powerful.

lbeurerkellner2y ago

Thank you for bringing up LMQL. We have active branches with regex, parsers and types (structured and other types) which will also soon be upstreamed, improving typed LLM use beyond the current template-based approach we support.

sebmellen2y ago

Yes! I don’t have much to say that wouldn’t be restating what I’ve already written, so I’ll link this for reference: https://www.sebastianmellen.com/post/2023/the-killer-use-cas....

darkteflon2y ago

I'd never heard of LMQL before today, but it looks very nice. Do you have any experience building with it and, if so, would you be willing to comment on what it's like?

theblazehen2y ago

I did a POC project with it recently. The guidance on gpt-3.5-turbo and gpt-4 models isn't as functional as plain gpt-3. I found I had better results using https://github.com/piercefreeman/gpt-json and it doesn't require multiple calls to the API. Not as feature filled, but it may meet your needs

darkteflon2y ago

Thanks for the recommendation - gpt-json looks quite nice, actually - will check it out.

sandGorgon2y ago

has there been any work done to finetune OSS models to behave the same as openai functions (to constrain json output) ?

this is tablestakes now, but it doesnt seem ANY opensource model has this capability