What You Will Be Doing:
- Model Development and Tuning: Creation and fine-tuning of transformer models (e.g., DeBERTa-v3-large, GPT, or their analogs) for high-precision identification of AI-generated texts.
- Data Management: Building and processing scalable, diverse, and high-quality datasets, including human texts (e.g., The Pile) and AI-generated texts.
- Data Synthesis: Generation of balanced datasets using modern language models (e.g., GPT-4, LLaMA3, Starling-LM:7B-beta).
- Scalability of Solutions: Development of distributed systems for scalable training, testing, and deployment of models.
- Performance Optimization: Improving model accuracy, robustness, and scalability, including handling edge cases and complex examples.
- Model Validation: Conducting rigorous testing using balanced datasets to ensure the objectivity and reliability of models.
Requirements:
- Experience with Transformer Models: Deep understanding of architectures such as BERT, DeBERTa, GPT, or their analogs, including expertise in fine-tuning and modification.
- NLP Expertise: Excellent knowledge of natural language processing and tasks related to distinguishing AI-generated texts.
- Programming: Proficient in Python and frameworks such as PyTorch, TensorFlow, and Hugging Face Transformers.
- Data Handling Skills: Experience in processing large volumes of data and creating clean, balanced training datasets.
- Working with LLMs: Practical experience with modern large language models (LLMs) and their fine-tuning.
- Distributed Systems: Knowledge of distributed solutions for scalable computations and efficient use of computational resources.
- Analytical Thinking: Ability to analyze complex tasks and develop optimal solutions for text detection.
Bonus Points:
- Experience with Tools for AI Model Evaluation.
- Knowledge of Adversarial Testing Methods and their application to improve model reliability.
- Familiarity with Collaborative Development Tools (e.g., GitHub, CI/CD).
- Interest in Ethical and Responsible AI Use.
What We Offer:
- Participation in the development of a fast-growing product operating in real-time markets.
- A competitive salary based on your qualifications and interview performance, ranging from $8,000 to $15,000.
- Opportunities to enhance your expertise by working with top-tier colleagues and learning on the job.
- A dynamic and supportive team of professionals who value integrity, honesty, and openness.
- English classes with a native speaker, health insurance after the probation period, and thoughtful holiday gifts.
- The chance to implement bold and ambitious initiatives.
- A horizontal organizational structure with no bureaucracy or "big boss" syndrome.
- A results-driven work culture with a flexible schedule and fully remote opportunities.
If this sounds like you, apply now to join our team!