Nice job, Lorenzo. Thanks a lot!
Would it also work flushing the LLM response stream to another http response where the request is initiated by a browser (e.g. via REST/CSP)?
Played around a bit and I think CSP Gateway is buffering until the call is finished. So interim writes and flushes won't get through even if browser is capable of sse handling. Setting Content-Type to 'text/event-stream' and X-Accel-Buffering to 'no' in the response header won't help.
Any ideas on that?
- Log in to post comments