Opening. This role is to provide complete assessment of models and identify the weakness point revealed in evaluation. The ideal candidate will have experience in model assessment and evaluation task development, including public and in-house benchmarking.
What you'll do
- Provide complete assessment of models.
- Deep dive into model training and data to identify the weakness point revealed in evaluation.
- Communicate with modeling and data team to come up with plans to improve model quality.
What you need
- Model assessment and evaluation task development experience.
- Familiarity with inference frameworks like SGlang and vLLM.