Data Engineer

(55 days ago)

Alibaba Travels

Tehran/ Kooye Bimeh

Full Time

Working days and hours

شنبه تاچهارشنبه

Business trips

Facilities and Benefits

درباره شرکت

Company Size

501 - 1000 employees

Industry

Internet Provider / E-commerce / Online Services

Company Type

Iranian company dealing with Iranian and foreign customers

Establishment year

1393

Ownership type

Privately held

Company score

3.8

Products or Services

Travel, Hotel, Tourism & Tour

About Company

Alibaba Group is an Iranian technology-oriented and leading tourism holding company with 11 brands and more than 500 staff. The vision of the Alibaba Group is to generate wealth through tourism by pivoting technology and transforming organizational culture in Iran. The headquarter is located at Azadi Innovation Factory; the building has a highly modern and dynamic environment as well as a very distinguished architecture. Growth and transformation are easy to explore Alibaba Group; such a context is inherited from confronting real challenges. Staff at Alibaba Group can access generous resources in order to explore their potentials. Alibaba Group is the land of opportunities. We enjoy friendly interactions while making all the decisions collectively. A league of champions and professionals are accompanying Alibaba Group who are bold and ambitious enough to create anything they want. Alibaba Group stands for the future while the sky is the limit. It is our duty to transform the quality of people’s lives: colleagues, clients, and fellow Iranian citizens. Alibaba Group’s 11 brands are as follows: • Alibaba • Jabama • Toosha • Brandist • Altrabo • Simorgh • Medgo • Neshanet • Sindbad • Nabro • Dobby Through technological development, we in alibabagroup stand by creating wealth, society and transformation of working culture in Iran. We do not wait for the future, we create it ourselves. We have friendly relationships with our awesome colleagues and we experience a mode of being and performance that empowers us. This is the land of opportunities; in alibaba, we create a context for exploration, transformation, and growth of our colleagues.

Company benefits

Loan

Military Service Option

Health insurance

Recreational accommodation

Flexible working hours

Learning stipends

Game room

Lunch

Snacks

Gym facilities

Resting space

Recreational and tourism facilities

Breakfast

Library

Occasional packages and gifts

مشاهده سایر موقعیت های شغلی این سازمان

توضیحات بیشتر

key Requirements

6 years experience in similar position

Job Description

We are looking for a skilled Data Engineer to join our team (one of Alibaba group business) and play a key role in building the data infrastructure for our AI-powered Legal Assistant. The ideal candidate will have strong expertise in data pipelines, preprocessing, and preparing high-quality training datasets for language models (LLMs).

You will work closely with AI engineers, MLOps, and product teams to ensure that our legal AI models are trained on clean, structured, and reliable data.

Responsibilities:

Design, build, and maintain scalable ETL/ELT pipelines for legal and judicial data.
Collect, clean, normalize, and preprocess large volumes of unstructured text data.
Prepare and manage training datasets for NLP and LLM models.
Collaborate with AI Engineers to fine-tune models using annotated datasets.
Implement automated data quality checks and validation processes.
Manage databases, data storage, and optimize data access for ML pipelines.
Ensure compliance with data privacy and security standards.

Requirements:

Strong programming skills in Python (Pandas, PySpark, etc.).
Experience with data pipeline frameworks (Airflow, Luigi, Prefect).
Hands-on experience with databases (SQL/NoSQL, PostgreSQL, MongoDB, ElasticSearch).
Familiarity with Spark NLP or other large-scale NLP frameworks.
Solid understanding of text data preprocessing (tokenization, normalization, cleaning).
Familiarity with NLP datasets and annotation workflows.
Knowledge of data security and handling sensitive information.

Nice to Have:

Experience with legal or domain-specific datasets.
Familiarity with Persian (Farsi) NLP models (e.g., ParsBERT, HooshvareLab BERT, mBERT, XLM-R) for NER tasks.
Understanding of MLOps workflows and integration with AI models.

Benefits:

Be part of an innovative and impactful project in the legal-tech domain.
Work in a collaborative and fast-learning environment.
Opportunity to grow professionally with exposure to cutting-edge AI technologies.

Job Requirements

Gender

Men / Women

ثبت مشکل و تخلف آگهی

ارسال رزومه برای شرکت سفرهای علی‌بابا

سوابق ارسال رزومه برای این شرکت