Movatterモバイル変換


[0]ホーム

URL:


Skip to content

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
@davidberenstein1957
davidberenstein1957
Follow
View davidberenstein1957's full-sized avatar
🦦

David Berenstein davidberenstein1957

🦦

Organizations

@argilla-io@huggingface

Block or report davidberenstein1957

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more aboutblocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more aboutreporting abuse.

Report abuse

From failing to study medicine ➡️ BSc industrial engineer ➡️ MSc computer scientist.
Life can be strange, so better enjoy it.
I´m sure I do by: 👨🏽‍🍳 Cooking, 👨🏽‍💻 Coding, 🏆 Committing.

Conferences/Presentations 📖

  • Synthetic Data - Weaviate Podcast #118! -podcast
  • SmolAgents - FromBells and Whistles to Agents and Tools -slidesvideo
  • No data? No problem! - synthetic data to the rescue -slides
  • Practical AI Podcast - Towards high-quality (maybe synthetic) datasets -podcast
  • Code Together Podcast Intel Software - Scaling LLM Datasets with Less Effort Using Argilla -video
  • Mastering LLMs - Creating, curating, and cleaning data for LLMs -slidesvideo
  • 🧼 From GPU-poor to data-rich - data quality practices for LLM fine-tuning -slides
  • Deeplearning.ai LLM workshop - get started with Argilla for human- and distilabel for AI feedback -video
  • NLP Healthcare Summit 2023 - Smart Shortcuts for Bootstrapping a Healthcare NER Project -video
  • Anyscale Ray Europe Meetup - Smart shortcuts for Bootstrapping a Text Classification project -video

AI Code Content

Employers 👨🏽‍💻

  • Hugging Face 🤗 (2024-current) - The AI community building the future
  • Argilla (2022-2024) - data annotation and monitoring for enterprise NLP
  • Pandora Intelligence (2020-2022) - an independent intelligence company, specialized in security risks

Open source ⭐️

Maintainer 🤓

Contributions 🫱🏾‍🫲🏼

Volunteering 🌍

  • Bonfari - small to medium sustainable scale projects in Gambia 🇬🇲
  • 510 red-cross - occasional projects to improve humanitarian aid with data

Contacts

GmailLinkedInTwitter

PinnedLoading

  1. argilla-io/argillaargilla-io/argillaPublic

    Argilla is a collaboration tool for AI engineers and domain experts to build high-quality datasets

    Python 4.4k 420

  2. argilla-io/distilabelargilla-io/distilabelPublic

    Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verified research papers.

    Python 2.6k 188

  3. dataset-viberdataset-viberPublic

    Dataset Viber is your chill repo for data collection, annotation and vibe checks.

    Python 46 12

  4. concise-conceptsconcise-conceptsPublic

    This repository contains an easy and intuitive approach to few-shot NER using most similar expansion over spaCy embeddings. Now with entity scoring.

    Python 244 14

  5. crosslingual-coreferencecrosslingual-coreferencePublic

    A multi-lingual approach to AllenNLP CoReference Resolution along with a wrapper for spaCy.

    Python 105 18

  6. spacy-setfitspacy-setfitPublic

    This repository contains an easy and intuitive approach to use SetFit in combination with spaCy.

    Python 78 5


[8]ページ先頭

©2009-2025 Movatter.jp