Amanbir Singh,印度德里的开发者
Amanbir is available for hire
Hire Amanbir

Amanbir Singh

Verified Expert  in Engineering

数据科学家和后端开发人员

Location
Delhi, India
Toptal Member Since
September 13, 2021

Amanbir在数据科学、分析和后端工程方面拥有10年的经验. 他曾在一家大型多边组织和早期科技创业公司工作. Amanbir擅长与客户合作解决复杂的商业问题,并在机器学习方面拥有深厚的专业知识, data analysis, 构建可扩展的web应用程序.

Portfolio

ATS Software
人工智能,机器学习,Python, MySQL, GPT...
Monsoon CreditTech
Python, Pandas, Django, Angular, Docker, Kubernetes,机器学习...
IISD实验湖区公司-主要
机器学习,数据科学,Python, PostgreSQL,亚马逊网络服务(AWS)...

Experience

Availability

Part-time

首选的环境

Python, Data Analytics, Data Science, Machine Learning, Pandas, 生成预训练变压器(GPT), OpenAI GPT-3 API, 最小可行产品(MVP), 生成预训练变压器3 (GPT-3), OpenAI GPT-4 API, User Interface (UI), Product Management, 大型语言模型(llm)

The most amazing...

...我参与的数据科学项目是从头开始构建一个用于信用风险评估的自动机器学习平台.

Work Experience

ML Developer

2023 - PRESENT
ATS Software
  • 致力于计算机视觉模型,从非结构化PDF文件(包括图纸)中提取信息, tables, etc.).
  • 对NER模型进行操作,从自然语言和非结构化文本中提取信息.
  • 采用GPT-4对AI流水线进行后处理,提高性能. 还包括基于规则的后处理,以提高管道性能.
  • 将整个平台部署在AWS SageMaker上,并与客户端的堆栈集成.
  • 训练多模态模型以提高NER性能.
Technologies: 人工智能,机器学习,Python, MySQL, GPT, Amazon SageMaker, Computer Vision, 命名实体识别(NER), Object Detection, Text Detection, Generative AI, Supervised Learning

产品和工程主管

2016 - PRESENT
Monsoon CreditTech
  • Led the development of the SaaS AutoML platform as an architect and product manager; made wireframes, 编写用户和功能需求, 决定后端架构, 用Django跑短跑, Angular, Jenkins, and Docker.
  • 内部使用的结构化AutoML库. 该平台生成了针对借贷进行优化的机器学习模型.
  • 担任我们内部数据科学团队使用的开发人员工具的产品经理和架构师,以加快模型开发和部署.
  • Managed client engagements with 15 banks and NBFCs; built and deployed models to identify risky borrowers at the time of application. 为客户增加了20%以上的收入.
  • 雇佣并管理了一个由10多名数据科学家和软件开发人员组成的团队. 进行一对一的指导,为团队设定目标,并指导初级成员.
  • 为支持多个和多阶段模型的机器学习模型构建了自动部署流程.
Technologies: Python, Pandas, Django, Angular, Docker, Kubernetes,机器学习, Data Science, 机器学习操作(MLOps), XGBoost, Jupyter Notebook, SQL, Data Analytics, Data Visualization, Data Mining, Web Scraping, Data Reporting, 人工智能(AI), Agile, Data Analysis, Time Series, Time Series Analysis, Optimization, Financial Modeling, 亚马逊网络服务(AWS), MySQL, Azure, Scikit-learn, Statistics, Statistical Analysis, Real-time Data, Predictive Analytics, APIs, Banking & Finance, Architecture, Leadership, Automation Scripting, Scripting, AWS Lambda, REST APIs, Amazon S3 (AWS S3), HTML, Decision Trees, Data Scientist, 自然语言处理(NLP), 推荐系统, Regression, PDF Scraping, Scraping, Back-end, 软件架构, Azure ML Studio, Git, Amazon DynamoDB, PostgreSQL, 不良贷款(NPL), Data Scraping, TypeScript, NumPy, MongoDB, Serverless, Predictive Modeling, 客户细分, Visualization, Django REST框架, 完整的开发, API Integration, AI Design, Automation, Full-stack, CSS, Flask, 解决方案架构, Software Development, PyPDF2, openpyxl, Microservices, Advisory, Technology Strategy & Architecture, Databases, Web Development, CTO, DevOps, 谷歌云平台(GCP), JavaScript, 对象关系映射(ORM), Technical Leadership, Database Architecture, 敏捷软件开发, Data Structures, Amazon SageMaker, ETL, 最小可行产品(MVP), 需求分析, Startups, Mathematics, Task Scheduling, Regular Expressions, Sockets, Linear Regression, 数据驱动的决策, Decision Modeling, Neural Networks, Programming, Integration, User Interface (UI), Cloud, Models, 探索性数据分析, EDA, Modeling, Data Cleaning, 非结构化数据分析, Large Data Sets, Data Gathering, Spreadsheets, 机器学习自动化, Amazon弹性容器服务(Amazon ECS), Data Processing, Product Management, Amazon EC2, Back-end Development, Azure Cosmos DB, GitHub, Azure Functions, Azure Blobs, Scrapy, 大型语言模型(llm), Regression Modeling, Language Models, FastAPI, Containerization, Vertex, System Architecture, Product Roadmaps, Product Strategy, Team Leadership, Project Management, Azure机器学习, Pytest, Unit Testing, Statistical Modeling, NoSQL, 面向对象编程(OOP), Research, Cloud Computing, 无监督欺诈检测, 无监督学习, Supervised Learning, Open-source LLMs

数据科学家| ML专家

2023 - 2024
IISD实验湖区公司-主要
  • 利用气象数据开发了一个模型来预测一个湖泊冰融化的日期. 预测结果与实际融冰日期相差不到一天.
  • 使用boost、bagging和其他算法来提高性能.
  • 使用React创建了一个仪表板来显示模型预测和性能.
Technologies: 机器学习,数据科学,Python, PostgreSQL,亚马逊网络服务(AWS), React, Gradient Boosting, Scikit-learn, Google Earth, Statistical Modeling, 面向对象编程(OOP), Cloud Computing, Generative AI, Supervised Learning

Data Scientist

2023 - 2023
独立研究小组
  • 创建了一个模拟来模拟不同经济参与者(公司)之间的相互作用, employees, 非经济参与者, etc.).
  • 通过马尔可夫链模拟来了解不同初始状态和干预的影响.
  • 创建输出可视化和统计数据来测试假设.
技术:数据科学, Agent-based Modeling, R, Python, 马尔科夫链蒙特卡罗(MCMC)算法, 蒙特卡罗模拟, Simulations, 无监督学习

AI/ML Developer

2023 - 2023
美国的解释
  • 开发了一个实时翻译API,可以跨任何语言将语音转换为语音.
  • 在Django中构建了一个后端来处理流音频数据,并返回翻译后的音频数据和转录. 后端还处理了会议创建和会议加入.
  • 在React中创建了一个前端,并使用RecordRTC捕获音频. 建立WebSocket连接,允许音频流到后端.
  • 部署在Azure服务的前端和后端.
  • 集成多种翻译和语音生成服务.
Technologies: Python, 人工智能(AI), Machine Learning, Text to Speech (TTS), Speech to Text, 自然语言处理(NLP), React, Azure Text to Speech, Elementor, Django, Azure, OpenAI, WebSockets, TypeScript, JavaScript, RecordRTC, Voice Recognition, Language Models, Prompt Engineering, System Architecture, 新产品开发, 面向对象编程(OOP), Cloud Computing, Generative AI

AI /毫升专家/顾问

2023 - 2023
Harbor
  • 是否促使工程改进LLM模型预测.
  • 将开源法学硕士与封闭模型进行比较.
  • 公司基础设施上的自托管开源法学硕士.
  • 在Python中构建一个提示测试框架来比较和改进提示.
技术:OpenAI GPT-3 API, GPT, OpenAI GPT-4 API, 生成预训练变压器(GPT), 生成预训练变压器3 (GPT-3), AIOps, 机器学习操作(MLOps), 自然语言处理(NLP), 图形处理器(GPU), AI Design, Amazon SageMaker, Hugging Face, ChatGPT, Amazon EC2, Back-end Development, GitHub, LangChain, Pinecone, 大型语言模型(llm), OpenAI, LlamaIndex, Language Models, Prompt Engineering, Containerization, System Architecture, Project Management, 检索增强生成(RAG), Llama 2, NoSQL, 面向对象编程(OOP), Research, Cloud Computing, Generative AI, Open-source LLMs

AI/ML Engineer

2023 - 2023
Grown Unknown, LLC
  • 开发使用OpenAI api生成定制父母建议的提示.
  • 向提示添加上下文,以调整输出的语气.
  • 将OpenAI与其他选项进行比较,制定未来产品开发计划.
Technologies: Python, Machine Learning, Language Models, OpenAI GPT-4 API, OpenAI GPT-3 API, GPT, Data Scientist, Language Learning, Generative Systems, 自然语言处理(NLP), ChatGPT, 大型语言模型(llm), OpenAI, Prompt Engineering, System Architecture, Generative AI

机器学习专家

2023 - 2023
AmpVis Ltd.
  • 建议客户构建MVP,包括所需的所有技术步骤.
  • 决定团队结构来处理不同的产品决策.
  • 为其他技术职位的招聘决策提供咨询.
Technologies: Python, Machine Learning, 人工智能(AI), Data Science, APIs, Google Vision API, Amazon Rekognition, Programming, Cloud, Models, Data Scientist, Generative Systems, Deep Learning, 大型语言模型(llm), Product Roadmaps, Product Strategy

Data Scientist

2023 - 2023
NewCloud Medical LLC
  • 构建了一个Looker Studio仪表板,以显示基于过滤器的数据和汇总统计信息.
  • 在Looker Studio中添加可视化功能,以从数据中生成见解.
  • 创建了根据所选字段动态更新的仪表板视图.
Technologies: Python, PDF Scraping, Scraping, Databases, Looker, Programming, Language Models, GPT, Data Cleaning, Data Scientist, Spreadsheets, Data Processing, 大型语言模型(llm)

Research Coordinator

2015 - 2016
JustJobs Network
  • 建立内部数据管理系统来跟踪数据集的版本.
  • 领导了印度职业培训和技能建设项目的研究. Led data collection and analysis; published a findings report.
  • 设计统计学和R培训模块,用于新员工培训.
Technologies: Python, R, Data Analytics, Data Visualization, Data Mining, Web Scraping, Data Reporting, Data Analysis, Statistics, Statistical Analysis, Automation Scripting, Scripting, Data Scientist, Regression, Scraping, Git, Predictive Modeling, Visualization, Automation, Mathematics, Linear Regression, 数据驱动的决策, Decision Modeling, Programming, Models, 探索性数据分析, EDA, Modeling, Data Cleaning, 非结构化数据分析, Data Gathering, Spreadsheets, Data Processing, Regression Modeling, Project Management, Statistical Modeling, Research, Supervised Learning

Consultant

2014 - 2015
World Bank Group
  • 监督全州范围内4500个个人和家庭调查的数据收集.
  • 建立模型以确定影响青少年教育和劳动力市场结果的因素.
  • 参与研究成果的传播.
Technologies: R, Data Science, Data Analytics, Data Visualization, Data Mining, Data Reporting, Data Analysis, Statistics, Statistical Analysis, Automation Scripting, Scripting, Regression, Git, Predictive Modeling, Visualization, ETL, Mathematics, Linear Regression, 数据驱动的决策, Decision Modeling, Programming, Models, 探索性数据分析, EDA, Modeling, Data Cleaning, 非结构化数据分析, Data Gathering, Spreadsheets, Data Processing, Regression Modeling, Project Management, Statistical Modeling, Research, 无监督学习, Supervised Learning

高级研究助理

2012 - 2014
小额信贷研究中心
  • 管理两项随机对照试验,研究印度金融准入的影响.
  • 培训和监督一个由30名成员组成的实地小组,在四个地区进行1 700项个人调查.
  • 使用Open Data Kit和SurveyCTO设计并实施了6份电子问卷,并构建了调查数据的后端.
Technologies: STATA, Survey Design, Open Data Kit, Data Visualization, Data Mining, Data Reporting, Data Analysis, Causal Inference, Statistics, Statistical Analysis, Automation Scripting, Regression, Visualization, Mathematics, Linear Regression, 数据驱动的决策, Models, 探索性数据分析, EDA, Modeling, Data Cleaning, 非结构化数据分析, Data Gathering, Spreadsheets, Data Processing, Regression Modeling, Project Management, Statistical Modeling, Research, 无监督学习, Supervised Learning

自动贷款平台

http://monsoonfintech.com/thoth/
建立了一个AutoML平台,从贷款人那里获取数据,并生成最先进的机器学习模型. 支持传统财务数据和替代(短信,手机等).) data.

该平台为新应用程序生成模型,并帮助收集运行贷款. 这是作为SaaS产品提供的.

为贷款人定制机器学习模型

http://monsoonfintech.com/
管理一个由开发人员和数据科学家组成的团队,为贷款人构建模型. 这包括预测贷款申请风险的模型, 金融产品推荐引擎, 和营销模式接触,以确定目标客户.

为印度最大的银行制造并交付模型. 这使得拖欠率降低了30%,贷款批准率提高了25%.

世界银行报告

http://documents.worldbank.org/en/publication/documents-reports/documentdetail/866381523450216235/a-window-of-opportunity-a-diagnostic-of-adolescent-girls-and-young-women-s-socio-economic-empowerment-in-jharkhand-india
与世界银行密切合作,确定重大挑战, 伴随着关键的改革, 贾坎德邦的青春期女孩, India were facing.

我的工作包括实验设计、数据收集、分析和建模. 我还负责报告的传播和与主要利益相关者的沟通.

Languages

Python, HTML, R, SQL, TypeScript, CSS, JavaScript

Frameworks

Django, Django REST框架,Bootstrap, Material UI, LlamaIndex, Angular, Flask, Scrapy

Libraries/APIs

Pandas, XGBoost, Scikit-learn, REST APIs, NumPy, Beautiful Soup, Sockets, Google Vision API, Amazon Rekognition, React, RecordRTC

Tools

Amazon SageMaker, ChatGPT, Git, Spreadsheets, Amazon弹性容器服务(Amazon ECS), GitHub, Azure机器学习, Pytest, STATA, Open Data Kit, Azure ML Studio, Looker, 命名实体识别(NER)

Paradigms

Data Science, Automation, 对象关系映射(ORM), 面向对象编程(OOP), Agile, Microservices, 敏捷软件开发, ETL, 需求分析, Unit Testing, DevOps, Agent-based Modeling

Platforms

Jupyter Notebook, AWS Lambda, Amazon EC2, Docker, 亚马逊网络服务(AWS), Azure, Azure Functions, Kubernetes, 谷歌云平台(GCP)

Storage

MySQL, Amazon S3 (AWS S3), PostgreSQL, MongoDB, Databases, Database Architecture, Azure Cosmos DB, Azure Blobs, NoSQL, Amazon DynamoDB

Industry Expertise

项目管理、银行 & Finance

Other

Machine Learning, Data Analytics, Data Mining, Web Scraping, 人工智能(AI), Data Analysis, Statistics, Statistical Analysis, Predictive Analytics, APIs, Architecture, Automation Scripting, Scripting, Decision Trees, Data Scientist, 自然语言处理(NLP), Regression, PDF Scraping, Scraping, Back-end, 软件架构, 不良贷款(NPL), Data Scraping, Predictive Modeling, 客户细分, Visualization, 完整的开发, API Integration, Software Development, PyPDF2, Advisory, Technology Strategy & Architecture, Web Development, CTO, Technical Leadership, 生成预训练变压器(GPT), OpenAI GPT-3 API, 最小可行产品(MVP), Startups, Regular Expressions, Linear Regression, 数据驱动的决策, Programming, Integration, Models, GPT, 探索性数据分析, EDA, Modeling, Data Cleaning, 非结构化数据分析, Large Data Sets, Data Gathering, 机器学习自动化, Data Processing, Back-end Development, Regression Modeling, 大型语言模型(llm), OpenAI, Prompt Engineering, System Architecture, Product Roadmaps, Product Strategy, 新产品开发, Team Leadership, Statistical Modeling, 无监督学习, Supervised Learning, 机器学习操作(MLOps), Data Visualization, Data Reporting, Time Series, Time Series Analysis, Real-time Data, Leadership, 推荐系统, Serverless, AI Design, Full-stack, 解决方案架构, Data Structures, 生成预训练变压器3 (GPT-3), Mathematics, Task Scheduling, OpenAI GPT-4 API, Decision Modeling, Neural Networks, Cloud, Language Models, Language Learning, Generative Systems, Product Management, LangChain, Speech to Text, Voice Recognition, FastAPI, Containerization, 检索增强生成(RAG), Llama 2, Research, Cloud Computing, 无监督欺诈检测, Generative AI, Open-source LLMs, Survey Design, SaaS, Optimization, Financial Modeling, Causal Inference, openpyxl, User Interface (UI), Deep Learning, AIOps, 图形处理器(GPU), Hugging Face, Pinecone, Text to Speech (TTS), Azure Text to Speech, Elementor, WebSockets, Vertex, Gradient Boosting, Google Earth, 马尔科夫链蒙特卡罗(MCMC)算法, 蒙特卡罗模拟, Simulations, Computer Vision, Object Detection, Text Detection

2008 - 2012

经济学和统计学学士学位

卡内基梅隆大学-匹兹堡,宾夕法尼亚州,美国

有效的合作

如何使用Toptal

在数小时内,而不是数周或数月,我们的网络将为您直接匹配全球行业专家.

1

Share your needs

在与Toptal领域专家的电话中讨论您的需求并细化您的范围.
2

Choose your talent

在24小时内获得专业匹配人才的简短列表,以进行审查,面试和选择.
3

开始你的无风险人才试验

与你选择的人才一起工作,试用最多两周. 只有当你决定雇佣他们时才付钱.

对顶尖人才的需求很大.

Start hiring