Movatterモバイル変換


[0]ホーム

URL:


Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
Appearance settings

Japanese Livedoor news corpus for huggingface datasets

NotificationsYou must be signed in to change notification settings

shunk031/huggingface-datasets_livedoor-news-corpus

Repository files navigation

annotations_creatorslanguagelanguage_creatorslicensemultilingualitypretty_namesize_categoriessource_datasetstagstask_categoriestask_ids
ja
found
cc-by-nd-4.0
monolingual
livedoor-news-corpus
text-classification
multi-class-classification

CI

code-example

Table of Contents

Dataset Description

Dataset Summary

本コーパスは、NHN Japan 株式会社が運営する「livedoor ニュース」のうち、下記のクリエイティブ・コモンズライセンスが適用されるニュース記事を収集し、可能な限り HTML タグを取り除いて作成したものです。

Supported Tasks and Leaderboards

[More Information Needed]

Languages

[More Information Needed]

Dataset Structure

Data Instances

fromdatasetsimportload_datasetdataset=load_dataset("shunk031/livedoor-news-corpus",train_ratio=0.8,val_ratio=0.1,test_ratio=0.1,random_state=42,shuffle=True,)print(dataset)# DatasetDict({#     train: Dataset({#         features: ['url', 'date', 'title', 'content', 'category'],#         num_rows: 5894#     })#     validation: Dataset({#         features: ['url', 'date', 'title', 'content', 'category'],#         num_rows: 737#     })#     test: Dataset({#         features: ['url', 'date', 'title', 'content', 'category'],#         num_rows: 736#     })# })

Data Fields

[More Information Needed]

Data Splits

[More Information Needed]

Dataset Creation

Curation Rationale

[More Information Needed]

Source Data

Initial Data Collection and Normalization

[More Information Needed]

Who are the source language producers?

[More Information Needed]

Annotations

Annotation process

[More Information Needed]

Who are the annotators?

[More Information Needed]

Personal and Sensitive Information

[More Information Needed]

Considerations for Using the Data

Social Impact of Dataset

[More Information Needed]

Discussion of Biases

[More Information Needed]

Other Known Limitations

[More Information Needed]

Additional Information

Dataset Curators

[More Information Needed]

Licensing Information

各記事ファイルにはクリエイティブ・コモンズライセンス「表示 – 改変禁止」が適用されます。 クレジット表示についてはニュースカテゴリにより異なるため、ダウンロードしたファイルを展開したサブディレクトリにあるそれぞれの LICENSE.txt をご覧ください。 livedoor は NHN Japan 株式会社の登録商標です。

Citation Information

[More Information Needed]

Contributions

Thanks toRONDHUIT Co., Ltd. for creating this dataset.

About

Japanese Livedoor news corpus for huggingface datasets

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages


[8]ページ先頭

©2009-2025 Movatter.jp