Conversation
…stinguish-api-keys Distinguish the models used in the executor and evaluator
Signed-off-by: Tarek <tareknaser360@gmail.com>
Signed-off-by: Tarek <tareknaser360@gmail.com>
Signed-off-by: Tarek <tareknaser360@gmail.com>
Signed-off-by: Tarek <tareknaser360@gmail.com>
Signed-off-by: Tarek <tareknaser360@gmail.com>
Signed-off-by: Tarek <tareknaser360@gmail.com>
Signed-off-by: Tarek <tareknaser360@gmail.com>
Signed-off-by: Tarek <tareknaser360@gmail.com>
Signed-off-by: Tarek <tareknaser360@gmail.com>
…s/sysmobench/sysmobench_core'
- Add gpt-4o model configuration to models.yaml - Fix setup_tools.py to use shutil.move instead of os.rename This resolves 'Invalid cross-device link' error when /tmp is on different filesystem
Signed-off-by: Tarek <tareknaser360@gmail.com>
Signed-off-by: Tarek <tareknaser360@gmail.com>
Signed-off-by: Tarek <tareknaser360@gmail.com>
Signed-off-by: Tarek <tareknaser360@gmail.com>
Signed-off-by: Tarek <tareknaser360@gmail.com>
Signed-off-by: Tarek <tareknaser360@gmail.com>
Signed-off-by: Tarek <tareknaser360@gmail.com>
* modified the regular expression match * added cs107_fall_2023_final * updated one wrong tag * Added pdf of the paper * solved all the comments
* Add UIUC CS423 Fall 2025 Exams * Fix json parsing * Fix backtick * Fix backtick 2 * Update benchmarks/courseexam_bench/data/cs_423_operating_systems_design_fall_2025_midterm/MP1.md Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> * Update benchmarks/courseexam_bench/data/cs_423_operating_systems_design_fall_2025_final/exam.md Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> --------- Co-authored-by: Xuan Feng <xfeng9209@gmail.com> Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>
…ce#54) * add ostep processes-shell lab * fix * Update benchmarks/courselab_bench/data/cs537-projects-spring-2019/processes_shell/task.md Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> --------- Co-authored-by: Tarek Elsayed <60650661+tareknaser@users.noreply.github.com> Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>
Signed-off-by: Tarek <tareknaser360@gmail.com>
Signed-off-by: Tarek <tareknaser360@gmail.com>
* added cmu15-213 data lab * docs(courselab): add note about infrastructure restrictions Signed-off-by: Tarek <tareknaser360@gmail.com> --------- Signed-off-by: Tarek <tareknaser360@gmail.com> Co-authored-by: Tarek <tareknaser360@gmail.com>
* add cs537 fall 2021 final exam * add institution * fix * add solutions * update metadata * add choice array * avoid extra restrictions on LLM output Signed-off-by: Tarek <tareknaser360@gmail.com> --------- Signed-off-by: Tarek <tareknaser360@gmail.com> Co-authored-by: Tarek <tareknaser360@gmail.com>
Signed-off-by: Tarek <tareknaser360@gmail.com>
Signed-off-by: Tarek <tareknaser360@gmail.com>
Signed-off-by: Tarek <tareknaser360@gmail.com>
Signed-off-by: Tarek <tareknaser360@gmail.com>
Signed-off-by: Tarek <tareknaser360@gmail.com>
Signed-off-by: Tarek <tareknaser360@gmail.com>
777lefty
left a comment
There was a problem hiding this comment.
Some missing issues and format issues
|
|
||
| ## Question 2 [3 point(s)] | ||
|
|
||
| Explain why the following implementation of semaphore P is incorrect. Provide an example interaction between two threads that illustrates the problem. |
There was a problem hiding this comment.
The code snippet for this question seems to be totally missed
| n /= d; | ||
| printf("%d\n", d); | ||
| return 0; | ||
| } |
There was a problem hiding this comment.
Better use table and code block to wrap them
|
|
||
| ## Question 7 [2 point(s)] | ||
|
|
||
| What concurrency problem does this program suffer from? |
There was a problem hiding this comment.
Code snippet provided in the background for all the subproblems of the problem is missed
| ``` | ||
|
|
||
| --- | ||
|
|
There was a problem hiding this comment.
Question 3 is skipped, but it seems to be feasible because it doesn't contain figure, and we can ask the agents to output table to represent stack
|
|
||
| --- | ||
|
|
||
| ## Question 10 [2 point(s)] |
There was a problem hiding this comment.
Background information for all subproblems in problem 4 is missed
| copy trapframe to stack | ||
| modify trapframe | ||
| enter usermode | ||
| } |
| ``` | ||
|
|
||
| --- | ||
|
|
There was a problem hiding this comment.
Question 5 is skipped, seems to be feasible as well
|
|
||
| --- | ||
|
|
||
| ## Question 15 [3 point(s)] |
There was a problem hiding this comment.
Background information for Question 6 is missed
67457ba to
0fc8bab
Compare
0fc8bab to
77bc8c0
Compare
Description
Add the CS350 Fall 2018 Midterm exam dataset and associated solution file to the course exam benchmark.
Changes
Testing
Prepared the dataset and verified processing by running: python3 courseexam/prepare.py
Checklist