Leonardo Schettini, Developer in Vienna, Austria
Leonardo is available for hire
Hire Leonardo

Leonardo Schettini

Verified Expert  in Engineering

Artificial Intelligence (AI) Developer

Location
Vienna, Austria
Toptal Member Since
February 22, 2022

莱昂纳多是一名熟练的数据科学家和软件工程师,专门从事自然语言处理. 他曾领导过不同团队规模的初创公司和公司的项目. 他的项目通常是用Python编写的,并部署在Kubernetes和Docker环境上. 李奥纳多在压力下保持冷静,学习和应用新技能的速度很快.

Portfolio

Crayon
Agile Software Development, Artificial Intelligence (AI), BERT...
TheVentury
Python, PyTorch, Scikit-learn, SpaCy, Pandas, NumPy, Matplotlib, Time Series...
Instituto Avançado de Tecnologia e Inovação (IATI)
Python,研究,时间序列,Tableau, Oracle, Redmine,数据处理...

Experience

Availability

Part-time

Preferred Environment

MacOS, Docker,敏捷软件开发,Visual Studio Code (VS Code), Python

The most amazing...

...我参与的项目是一系列机器学习工具,让听力受损的人能够接触到文本和音频.

Work Experience

NLP Data Scientist

2022 - PRESENT
Crayon
  • 领导为全球客户开发端到端NLP系统.
  • Leveraged GPT models from OpenAI, Azure services, LangChain, 内部工具构建“询问数据”解决方案框架,允许在50%的原始时间内为客户交付功能.
  • 针对不同阶段的“询问你的数据”解决方案,研究和试验评估技术.
  • 通过识别和帮助模块化和抽象可重用组件,促进跨团队协作.
  • 改进并维护了一个内部工具,使mlop生命周期自动化, using tools like GitHub Actions, Terraform, Docker, and Azure Machine Learning.
Technologies: Agile Software Development, Artificial Intelligence (AI), BERT, Machine Learning, Azure Cognitive Services, Azure ML Studio, Azure SQL Databases, Azure Functions, Terraform, Python, Natural Language Processing (NLP), Generative Pre-trained Transformers (GPT), GPT, Named-entity Recognition (NER), Information Extraction, Clustering, Data Processing, Text Processing, Sentiment Analysis, Text Classification, Docker, PyTorch, Pandas, NumPy, GitHub Actions, Azure DevOps, Jira, SQL, Git

Software Engineer | Data Scientist | DevOps Engineer

2019 - PRESENT
TheVentury
  • 在5人团队中担任多个项目的首席开发人员, including a core system for a corporate.
  • 开发了端到端机器学习系统,让听障人士更容易访问互联网, 利用语义相似度和时间序列预测技术.
  • 为机器学习产品开发主题建模和文本生成功能,帮助视频内容创建者了解竞争并构建视频.
  • Worked with early-stage startups, 就招聘方面的NLP应用和推荐系统提供咨询和开发, housing recommendations, and legal documents.
  • 在保证可复制性的同时,将部署项目暂存环境的时间减少了50%, enabling effortless deployments for feature branches, and allowing easy management of the deployments.
  • 自动化了几个Jira工作流,以减少管理开销.
Technologies: Python, PyTorch, Scikit-learn, SpaCy, Pandas, NumPy, Matplotlib, Time Series, Natural Language Processing (NLP), Generative Pre-trained Transformers (GPT), Chatbots, BERT, Kubernetes, Rancher, Docker, Docker Compose, Azure Virtual Machines, GitLab CI/CD, GitHub Actions, Jira, Jira REST API, Machine Learning, DevOps, Node.js, JavaScript, Natural Language Understanding (NLU), Information Extraction, Data Science, Supervised Learning, MySQL, PostgreSQL, REST, Flask, Microsoft SQL Server, Natural Language Toolkit (NLTK), Recommendation Systems, Data Processing, Jupyter Notebook, Research, Information Retrieval, Redis, Agile Software Development, Visual Studio Code (VS Code), MacOS, SQL, Git, Neural Networks, Artificial Intelligence (AI)

Graduate Research Assistant

2018 - 2020
Instituto Avançado de Tecnologia e Inovação (IATI)
  • Conducted a research project in conjunction with CPFL, 巴西第二大非国有发电和配电公司集团.
  • 对智能电表收集的时间序列数据进行描述性分析.
  • 部署和维护研究所需的应用程序, such as Oracle, Redmine, and Tableau.
Technologies: Python,研究,时间序列,Tableau, Oracle, Redmine,数据处理, Machine Learning, Linux, SQL, Git, Artificial Intelligence (AI)

Artificial Intelligence Researcher | Data Scientist

2018 - 2019
Recrut.ai
  • 开发了一个可扩展的无状态Python筛选工具,为联合利华和Neurotech减少了高达85%的招聘成本.
  • Implemented in-house solutions for tokenization, lemmatization, 以及多语言和非结构化文本的命名实体识别.
  • 实现了基于候选人简历的自动排名算法.
  • 在AWS Elastic Beanstalk工作环境中部署和维护应用服务器.
Technologies: Python, Scikit-learn, Natural Language Toolkit (NLTK), Matplotlib, Pandas, NumPy, Supervised Learning, Recommendation Systems, Information Extraction, Generative Pre-trained Transformers (GPT), Natural Language Processing (NLP), Natural Language Understanding (NLU), PostgreSQL, REST, Flask, Amazon Simple Queue Service (SQS), AWS Elastic Beanstalk, Data Processing, Jupyter Notebook, Data Science, Research, Information Retrieval, Machine Learning, Agile Software Development, Visual Studio Code (VS Code), Linux, Windows, SQL, Git, Neural Networks, Artificial Intelligence (AI), Clustering, Named-entity Recognition (NER), Text Processing, Sentiment Analysis, Text Classification

Chatbot Developer

2017 - 2018
Elife
  • 为万事达、雅马哈、超级宝克和欧宝等公司开发Facebook Messenger机器人.
  • 维护和改进基于朴素贝叶斯分类器的内部聊天机器人框架, capable of finding a user’s intent, keeping conversational context, and analyzing a user’s sentiment.
  • 开发了一款能够分析Instagram图片的内部软件, 搜索营销团队可以用来改进其咨询技术的信息.
  • Integrated applications with third-party APIs, such as Google Cloud Vision, Google Maps, and client-specific APIs.
Technologies: Node.js, JavaScript, Google Vision API, Databases, REST, Heroku, Data Processing, Agile Software Development, Windows, SQL, Git, Artificial Intelligence (AI)

Recrut.ai

http://recrut.ai/
A Python-based application for screening candidates. As the only developer working on the back end of the tool, I implemented algorithms for text processing, information extraction, and candidate ranking. 我还定义了软件架构,确保应用程序是可伸缩的. 该工具将联合利华和Neurotech的招聘成本降低了85%.

Ask Your Data Framework

一个使用OpenAI加速Ask Your Data解决方案开发和部署的框架, Azure stack, LangChain, and other related tools. 我向框架引入了一个评估策略,以量化对提示和会话工作流所做的改进. 我还概括了数据摄取管道和会话工作流, 允许将框架用于多个项目,并将完成这些项目所需的时间减少多达50%.

TheVentury

http://www.theventury.com
一个基于python的工具,用于生成3D角色的手语符号之间的运动序列. As a data scientist and lead developer, 我将软件与内部工具集成在一起,使动画师的工作流程自动化. 我还实现了一个启发式方法,根据手和模型的速度在符号和过渡之间分割运动序列,并使用算法来预测和平滑生成序列的运动.

我开发了另一个Python工具,用于根据使用的上下文将单词链接到它们的定义. For this project, 我必须训练一个能够处理英语和德语领域特定单词的模型.

Languages

Python, SQL, JavaScript

Libraries/APIs

Scikit-learn, Pandas, NumPy, Matplotlib, Natural Language Toolkit (NLTK), PyTorch, SpaCy, Jira REST API, Node.js, Google Vision API, Azure Cognitive Services

Tools

Docker Compose, GitLab CI/CD, Jira, Git, Named-entity Recognition (NER), Tableau, Redmine, Amazon Simple Queue Service (SQS), Jupyter, Azure ML Studio, Terraform

Paradigms

数据科学,REST,敏捷软件开发,DevOps, Scrum, Azure DevOps

Platforms

Rancher, Docker, MacOS, Visual Studio Code (VS Code), Kubernetes, Jupyter Notebook, Linux, Windows, Oracle, AWS Elastic Beanstalk, Heroku, Azure Functions, Azure

Other

Natural Language Processing (NLP), Machine Learning, Natural Language Understanding (NLU), Supervised Learning, Artificial Intelligence (AI), GPT, Generative Pre-trained Transformers (GPT), Time Series, Chatbots, BERT, Information Extraction, Recommendation Systems, Data Processing, Information Retrieval, Neural Networks, Clustering, Text Processing, Sentiment Analysis, Text Classification, Azure Virtual Machines, GitHub Actions, Research, Cryptography, Streaming Data, Software Engineering, Algorithms, OpenAI GPT-3 API, LangChain

Storage

MySQL, PostgreSQL,数据库,Redis, Microsoft SQL Server, Azure SQL数据库

Frameworks

Flask

2013 - 2018

Bachelor's Degree in Computer Science

Federal University of Pernambuco - Recife, Pernambuco, Brazil

2016 - 2017

Bachelor's Degree in Computer Science

University of Vienna - Vienna, Austria

JUNE 2016 - PRESENT

Scrum Weekend

Trampolim Academy

JULY 2014 - PRESENT

JavaScript

Centro Integrado de Tecnologia da Informação (CITI)

Collaboration That Works

How to Work with Toptal

在数小时内,而不是数周或数月,我们的网络将为您直接匹配全球行业专家.

1

Share your needs

在与Toptal领域专家的电话中讨论您的需求并细化您的范围.
2

Choose your talent

在24小时内获得专业匹配人才的简短列表,以进行审查,面试和选择.
3

Start your risk-free talent trial

与你选择的人才一起工作,试用最多两周. Pay only if you decide to hire them.

Top talent is in high demand.

Start hiring