Closed jwyang-google closed 3 months ago
Modify Jetstream to make prefill return first token. Pending testing with MLPerf loadgen.
Modify Jetstream to make prefill return first token. Pending testing with MLPerf loadgen.