Skip to content
View alaeddine-13's full-sized avatar

Organizations

@docarray

Block or report alaeddine-13

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
alaeddine-13/README.md

Alaeddine Abdessalem

LinkedIn GitHub Email

πŸ‘¨β€πŸ’» About Me

Machine Learning Engineer based in Berlin, Germany with expertise in LLMs, MLOps, and software engineering. Experienced in building and fine-tuning large language models, creating embedding models, and developing cloud-native ML platforms. ICPC World Finalist and passionate problem solver with a strong background in algorithmic thinking.

πŸ› οΈ Skills

Machine Learning & AI

  • LLM Engineering: Fine-tuning, evaluation, embedding models
  • Frameworks: PyTorch, TensorFlow, Transformers, TRL
  • ML Tools: DVC, Argilla, MT-Bench
  • Vector DBs: Qdrant, Weaviate

Software Engineering

  • Languages: Python, Java, C/C++, JavaScript, Golang, TypeScript, R
  • Backend: Django, Flask, FastAPI, NodeJS, Express
  • Cloud: AWS, GCP
  • DevOps: Docker, Kubernetes, Terraform, CI/CD (GitHub Actions, GitLab CI)
  • Databases: PostgreSQL, MySQL, MongoDB

πŸ’Ό Experience

Machine Learning Engineer at Jina AI (Jul 2022 - Dec 2023)

  • Contributed to training jina-embeddings-v3 with 8k sequence length and multi-lingual capabilities (2M+ downloads/month on HuggingFace)
  • Enhanced Starcoder language model through distillation and synthetic data training, achieving up to 27% pass@1 on HumanEval
  • Built Jina AI Cloud, a cloud-native MLOps platform for multimodal AI and neural search
  • Maintained open source MLOps framework (Jina) and vector DB library (DocArray)

Software Engineer Intern at Jina AI (Jul 2021 - Jun 2022)

  • Contributed to the release of the Jina 3.0 cloud-native MLOps framework
  • Added search engine components based on AI models to Jina hub
  • Assisted in building Jina AI Cloud with AI and search services

Lead Backend Engineer at Alternative Data Group (Aug 2020 - May 2021)

  • Improved entity recognition modules, ML models, and web crawlers
  • Managed AWS infrastructure for data processing services
  • Led team to achieve 97% overall product accuracy

πŸš€ Featured Projects

AI-powered Terraform generator that creates cloud resources using natural language.

InfraBot Demo

Features:

  • Natural language-based resource creation
  • Support for AWS cloud resources (S3, EC2, etc.)
  • Local infrastructure development with LocalStack
  • Component-based infrastructure management
  • Self-healing infrastructure creation

Tech Stack: Python, Terraform, AWS, LocalStack, OpenAI

AI-powered website creator using LLMs and VLMs to create, render and edit websites.

Demo Video

Previous Version (2021):

  • E-commerce and portfolio website builder
  • Backend: NodeJS, Express, MongoDB
  • Infrastructure: AWS ECS, S3, Route53
  • CI/CD with GitHub Actions and Terraform

Medicat (2021)

Web application using AI models to classify medical documents into ICD10 categories.

Tech Stack:

  • Pre-trained BERT model for document encoding
  • Classification model built with PyTorch
  • Backend: Python and Django
  • Infrastructure: AWS ElasticBeanstalk with CI/CD

πŸ† Achievements

  • ICPC World Finalist (2021) - First Tunisian team to qualify for this prestigious programming competition
  • First Prize OpenGovDataHack (2020) - MaintainIt: dashboard for predictive maintenance to reduce flight delays
  • Maghreb Champions, ACPC (2019) - Arab and Africa Collegiate Programming Championship
  • AWS Developer Associate Certification (Feb 2020 - Feb 2022)

πŸŽ“ Education

Software Engineering Degree
National Institute of Applied Science and Technology (INSAT)
Sep 2017 - Jun 2022, Tunis, Tunisia

🌐 Languages

  • English: Full Professional Proficiency
  • French: Full Professional Proficiency
  • Arabic: Native or Bilingual Proficiency

Open to collaboration and new opportunities in AI and ML engineering.

Pinned Loading

  1. MedicalExtraction MedicalExtraction Public

    Forked from YassineGabsi/MedicalExtraction

    Python 1

  2. Tunisair-Django Tunisair-Django Public

    Django version of the Tunisair REST API that won us the OpenGovDataHack2020

    Python

  3. NetworkAutomationPlatform NetworkAutomationPlatform Public

    Platform that manages network devices, collects data, executes commands and executes daily traffic tests

    Python 1

  4. graphsearch graphsearch Public

    Implementations of graph search algorithms

    Python 1 2

  5. ecs-fargate-s3-infrastructure ecs-fargate-s3-infrastructure Public

    Infrastructure to deploy a containerized web application using ECS Fargate with service autoscaling and an Application Load Balancer. Frontend deployment uses S3 static web hosting setup with a clo…

    HCL 1

  6. Quadtree implementation that was use... Quadtree implementation that was used to optimize crashy.io. Quadtree is used to optimize collision detection and reduce the time complexity for checks from O(nΒ²) to O(n log (n))
    1
    //******************* QuadTree Implementation *******************
    2
    var getX = function(item){
    3
    	return item.x - item.width/2;
    4
    }
    5
    var getY = function(item){