Commita1cddf5

authored and

committed

GITBOOK-23: Data Split

1 parente6b552b commita1cddf5Copy full SHA for a1cddf5

File tree

+19

-0

lines changed

+19

-0

lines changed

81.9 KB

102 KB

Lines changed: 19 additions & 0 deletions

Original file line number	Diff line number	Diff line change
`@@ -1,2 +1,21 @@`
`1`	`1`	`#2. Data Split`
`2`	`2`
	`3`	`+`
	`4`	`+`
	`5`	`+<figure><imgsrc="../.gitbook/assets/image (146).png"alt=""width="211"><figcaption></figcaption></figure>`
	`6`	`+`
	`7`	`+1. Click on_Data Split_ in the_Machine Learning_ category.`
	`8`	`+`
	`9`	`+`
	`10`	`+`
	`11`	`+<figure><imgsrc="../.gitbook/assets/image (147).png"alt=""width="563"><figcaption></figcaption></figure>`
	`12`	`+`
	`13`	`+2._Input Data_: Choose whether the target data is included in the input data. If it is, select_Feature Data_ and_Target Data_ separately. You can also select specific columns from one dataset using the_funnel icon_.`
	`14`	`+3._Test Size_: Select the percentage of input data to use for testing purposes.`
	`15`	`+4._Random State_: Generate the same random state, ensuring consistent data splits each time. (If not set, data will be randomly split differently each time.)`
	`16`	`+5._Shuffle_: Shuffle the data randomly to prevent the model from relying on the order of the data, thereby reducing bias and improving generalization performance.`
	`17`	`+6._Stratify_: Maintain class ratios when splitting the data to prevent over-representation of certain classes (Classification).`
	`18`	`+7._Allocate to_: Assign variable names to the split data.`
	`19`	`+8._Code View_: Preview the code that will be output.`
	`20`	`+9._Run_: Execute the code.`
	`21`	`+`

Comments

(0)