职责描述
We are looking for an AIGC Image Generation Engineer Intern to support the development of image generation and editing technologies based on diffusion models, flow matching, and related generative AI methods. This position is open to current students.
Responsibilities
• Develop and optimize image generation pipelines for text-to-image, image-to-image, image editing, inpainting, and style transfer tasks.
• Research, reproduce, and improve generative models based on diffusion models, flow matching, rectified flow, or other image generation frameworks.
• Fine-tune and adapt image generation models for specific styles, characters, products, or application scenarios.
• Work with techniques such as LoRA, ControlNet, IP-Adapter, DreamBooth, prompt engineering, and reference-based generation.
• Build and maintain training, inference, and evaluation pipelines using PyTorch and related tools.
• Improve generation quality, controllability, inference speed, and GPU memory efficiency.
• Prepare, clean, and organize image-text datasets for model training and evaluation.
• Analyze experiment results and write clear technical documentation.
Requirements
• Currently enrolled as a Bachelor’s, Master’s, or Ph.D. student in Computer Science, Artificial Intelligence, Computer Vision, Electrical Engineering, or a related field.
• Strong programming skills in Python and hands-on experience with PyTorch.
• Solid understanding of deep learning, computer vision, and generative models.
• Familiar with diffusion-based image generation models such as Stable Diffusion, SDXL, Flux, DiT, or similar models.
• Experience with at least one of the following is preferred: LoRA, ControlNet, IP-Adapter, DreamBooth, Hugging Face Diffusers, ComfyUI, or Automatic1111.
• Ability to read and reproduce research papers in generative AI or computer vision.
• Experience with model fine-tuning, dataset preparation, or inference optimization is a plus.
• Familiarity with Linux, Git, Docker, CUDA, TensorRT, or ONNX is a plus.
• Strong self-learning ability, good communication skills, and interest in building practical AIGC applications.
• Good communication skills in English
索尼中国研究院上海实验室以人机交互(HCI)为主要研究方向,致力于开发结合触觉技术(Haptics)和人工智能(AI)的全新娱乐体验技术,并将其应用于线下沉浸式娱乐(Location-Based Entertainment, LBE)领域。
我们的团队年轻,热情,富有创造力, 从不对自己设限,对技术有狂热且执着的追求。 我们想寻找志同道合的年轻人, 一起在项目中感受索尼的最新科技, 突破自我,自由翱翔。加入我们,用你的热情和创意推动科技和工业向前发展。