PrincySinghal/Html-code-generation-from-LLMPublic

NotificationsYou must be signed in to change notification settings
Fork0
Star9

Fine-Tuning and Evaluating a Falcon 7B Model for generating HTML code from input prompts.

You must be signed in to change notification settings

Folders and files

Name		Name	Last commit message	Last commit date
Latest commit History 27 Commits
README.md		README.md
data_finetune_html.json		data_finetune_html.json
html-generation-falcon7b.ipynb		html-generation-falcon7b.ipynb

Repository files navigation

Html-code-generation-from-LLMs

Objective:

Fine-tuning the Falcon 7B for the task of HTML code generation. The current model was selected based on its performance on complex reasoning benchmarks such as ARC and GSM8K and its compatibility with the available computational resources.

Dataset:

Usedhttps://huggingface.co/datasets/ttbui/html_alpaca dataset which contains:Size of data: 636 rows

Instructions- user prompts (textual)
Input-Further information needed as per the instruction could be html code or data points (textual+code)
Response- empty
Output- expected HTML code

Process

Model selection
Dataset Preparation and Preprocessing
Model Fine tuning script(setting hyperparameters and choosing fine tuning techniques and regularization)
Model Evaluation
API development to serve the model

Challenges and Errors encountered with resolutions

Understanding and implementing Parameter-Efficient Tuning (PeFT).
Managing the computational complexity and memory limitations of large models.
Ensuring reproducibility and consistency across training runs.
Dealing with long training times and optimizing model runtime.
Found ways to complete the training and evaluation without buying colab pro. Out of RAM error encountered during model training was solved by trying a different way to load my fine-tuned model rather than loading base model from scratch.

Solutions Implemented:

Adopting PeFT techniques like LoRA.
Utilizing quantization and model sharding to manage memory usage.
Setting a random seed for train-test splitting to ensure reproducibility.
Implementing precision training, early stopping and learning rate scheduling to improve convergence speed and solve GPU memory limitations.
Regularization techniques such as dropout and scaling factor were applied.
Training arguments were carefully set up to balance performance and resource usage.

List of hyperparameters that can be tweaked during training:

1.learning_rate: 0.00022. train_batch_size:3. eval_batch_size: 84. seed: 425.gradient_accumulation_steps: 26.total_train_batch_size: 47.optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-088.lr_scheduler_type: cosine9.lr_scheduler_warmup_ratio: 0.0310.training_steps: 320

Training Results

Model link-https://huggingface.co/PrincySinghal991/falcon-7b-sharded-bf16-finetuned-html-code-generation

Evaluation Results

BLEU score: 0.01782

Ongoing explorations:

LLMS more suited for code generation
Hyperparamter tuning to improve low evalution score

About

Fine-Tuning and Evaluating a Falcon 7B Model for generating HTML code from input prompts.

Releases

No releases published

Packages

No packages published

Languages

Jupyter Notebook100.0%

Movatterモバイル変換

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

Html-code-generation-from-LLMs

Objective:

Dataset:

Process

Challenges and Errors encountered with resolutions

Solutions Implemented:

List of hyperparameters that can be tweaked during training:

Training Results

Evaluation Results

Ongoing explorations:

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages

Languages

Movatterモバイル変換

PrincySinghal/Html-code-generation-from-LLM

Folders and files

Latest commit

History

Repository files navigation

Html-code-generation-from-LLMs

Objective:

Dataset:

Process

Challenges and Errors encountered with resolutions

Solutions Implemented:

List of hyperparameters that can be tweaked during training:

Training Results

Evaluation Results

Ongoing explorations:

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages0

Languages

Packages