Movatterモバイル変換


[0]ホーム

URL:


Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
Appearance settings
#

stratified-sampling

Here are 50 public repositories matching this topic...

mcs_kfold stands for "monte carlo stratified k fold". This library attempts to achieve equal distribution of discrete/categorical variables in all folds. The greatest advantage of this method is that it can be applied to multi-dimensional targets.

  • UpdatedSep 4, 2020
  • Python

Fast Online Triplet mining in Pytorch

  • UpdatedApr 2, 2020
  • Python

Predicts which telecom customers are likely to churn with 95% accuracy using real-world data features from usage, billing, and support data. Implements Sturges-based binning, one-hot encoding, stratified 80/20 train-test split, and a two-level ensemble pipeline with soft voting. Achieves 94.60% accuracy, 0.8968 AUC, 0.8675 precision, 0.7423 recall.

  • UpdatedJun 16, 2025
  • Python

An optimal stratified sample design for Commodity Flow Survey (CFS) based on Simulated Annealing and Genetic Algorithm. A script in Procedural PostgreSQL is used to generate a frame with 100,000 records based on publicly available data.

  • UpdatedFeb 3, 2021
  • R

Three business analytics case studies were undertaken, encompassing market basket analysis, customer segmentation, and campaign management. SAS Visual Data Mining and Machine Learning on SAS Viya was utilized to explore data and provide insights. A comprehensive report addressing both technical and business aspects was delivered.

  • UpdatedApr 15, 2024

Data consists of tweets scrapped using Twitter API. Objective is sentiment labelling using a lexicon approach, performing text pre-processing (such as language detection, tokenisation, normalisation, vectorisation), building pipelines for text classification models for sentiment analysis, followed by explainability of the final classifier

  • UpdatedApr 3, 2022
  • Jupyter Notebook

Data sampling library

  • UpdatedDec 19, 2024
  • Python

The objective is to analyze flight delays in the United States. Data from airlines, airports, and runways will be collected and processed. Machine learning models will be built using logistic regression, decision trees, and XGB classifiers. Visualizations will be created in Tableau, and Excel dashboards and SQL queries will be used for analysis.

  • UpdatedJun 21, 2023
  • Jupyter Notebook

Data-driven insights into netlabel download patterns.

  • UpdatedDec 11, 2024
  • R

This project focuses on applying advanced simulation methods for derivatives pricing. It includes Monte-Carlo, Variance Reduction Techniques, Distribution Sampling Methods, Euler Schemes, and Milstein Schemes.

  • UpdatedJul 9, 2024
  • Jupyter Notebook

Data sampling library

  • UpdatedDec 19, 2024
  • C++

Data sampling library

  • UpdatedJul 5, 2025
  • Java

Python package for stratifying, sampling, and estimating model performance with fewer annotations.

  • UpdatedMar 6, 2025
  • Python

A C library with Python bindings for efficient stratified random sampling from binary buffers or files.

  • UpdatedDec 10, 2022
  • C
ProfessorSurvey

Web scraper to get professor information, and a mass emailer that sends a website with a survey.

  • UpdatedJan 25, 2023
  • Jupyter Notebook

This repository contains Natural Language Processing Projects like Sarcasm Detection, Quora Insincere Questions Classification & Edgar Sentiment Analysis

  • UpdatedJul 14, 2020
  • Jupyter Notebook

Improve this page

Add a description, image, and links to thestratified-sampling topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with thestratified-sampling topic, visit your repo's landing page and select "manage topics."

Learn more


[8]ページ先頭

©2009-2025 Movatter.jp