Commit Graph

13 Commits

Author SHA1 Message Date
Federico Brancasi c33e376aa0 Allow invoking simulationMain with alternative action input (#4304)
* Allow invoking simulationMain with alternative action input

* Address review comments: rename CLI opts, extract pipeline, fix correctness issues

- Rename CLI options with --train- prefix (--train-input, --train-strategy,
  --train-out, --train-row-offset, --train-worker) and document all options
- Extract runInputPipeline/runInputPipelineParallel to test/pipeline/trainPipeline.ts
- Preserve original row index through parse/replay/prompt pipeline to fix
  sample numbering drift when rows are filtered out
- Fix parseSuggestedEdit: use JSON.parse for escaped text, handle missing delimiter
- Fix line number regex to accept optional space after | (WithoutSpace format)
- Clamp concurrency to >= 1, type samples as ISample[], wrap dispose in try/finally
- Gate verbose logging in loadAndParseInput behind verbose flag
- Use splitLines from existing utility instead of local duplicate

* move nes-datagen to a subcommand

* more code reuse around setting promptStrategy and model config

* Address review: use ResponseFormat, Limiter, assertNever, and raw messages

* minor refactor runPipeline

* finalize

* use POT instead of custom code

* move files from script/ to test/pipeline/

---------

Co-authored-by: ulugbekna <ulugbekna@gmail.com>
2026-03-30 15:44:46 +00:00
Matt Bierner 3c8134184b Enable no-unexternalized-strings in repo (#2448)
Enables the same `no-unexternalized-strings` with have in `vscode` in this repo. This make sure we have a more consistent style across repos and when generating edits
2025-12-05 18:45:12 +00:00
Shaun Miller 971dc0e40b Modifications to allow direct-endpoint tests to use responses api (#1047)
* add override for responses api for openai compatible endpoints via model config

* emoving the definition of supported_endpoints member because even though it is used, it is not part of the interface. But defining it in the interface causes issues.

* Modifications to allow direct-endpoint tests to use responses api

* add a little more defensiveness to how people encode the configuration file

* remove dead code

* the responses api flag moved, so adjusting the code that sets it

* put back accidentally removed code

* fix type issue
2025-09-25 17:14:39 +00:00
Shaun Miller d1ac17d720 restore correct service creation for list-models (#1090) 2025-09-24 13:21:00 +00:00
Matt Bierner 87410ce2d0 Update embedding names for simulation/test (#642) 2025-08-18 20:03:50 +00:00
SteVen Batten e18cf050d9 Support external cache layers (#572)
* support external cache layers

* cleanup

* address feedback
2025-08-18 14:36:23 +00:00
Vritant Bhardwaj 54c4771847 more model config support for simulations (#360)
* modify body intercepts

* allow disabling temperature

* change config

* allow specifying context window

* some updates

* rearrange and simplify

* little more cleanup

* treat auth as nullable

* treat capabilities as nullable

---------

Co-authored-by: Vritant Bhardwaj <vrtoku@gmail.com>
2025-08-04 13:41:43 +00:00
Vritant Bhardwaj 89bc148ed3 Add azure open ai support to simulator model config (#346)
* add aoai support

* add comment

* support `useDeveloperRole`

* change family
2025-07-24 15:28:12 +00:00
Vritant Bhardwaj f167b31c28 Add ability to specify models through config for simulation tests (#324)
* add ability to specify modelConfig from file

* use custom model in test endpoint provider

* fix model check

* change model config to be independant from specifying model to run

* support yaml

* use readFileSync

* remove yaml parsing

* improve comment

* fix api key

* fix comment

* make opt parsing alig to comment
2025-07-23 17:03:54 +00:00
Logan Ramos 27577393b6 Allow skipping /models cache (#336) 2025-07-22 19:43:42 +00:00
Shantnu Suman ab1805d7c0 Print Error literal at the start of all error messages for better parsing from the logs (#260) 2025-07-16 10:51:56 +00:00
João Moreno 0562b55dde propagate test run info to caches (#201)
* propagate test run info to caches

* fix tests
2025-07-11 12:59:21 +00:00
kieferrm 333d9a4053 Hello Copilot
Co-authored-by: copilot-swe-agent[bot] <198982749+Copilot@users.noreply.github.com>
2025-06-27 11:35:20 +02:00