pix2code pytroch implementation

A machine learning (ML) model that translates a screenshot of a website to its corresponding code representation. Inspired by the pix2code problem and dataset. This project was created as university project by Timo Angerer and Marvin Knoll (@marvinknoll)

For general information about the problem, architecture and implementation, see the project documentation.

To set up the project and use the model, refer to the next section of this document.

Setup and run the model

Follow the follwing steps to train and evaluate the model locally on your machine. Note: This project uses python 3.8.8 and pytorch 1.8

Clone the repository

Install the dependencies

Run the following command to create a new conda environment named pix2code with the required dependencies:

 conda env create python=3.8.8 -f environment.yml

or install all the dependencies manually:

 conda install -y -c pytorch pytorch=1.8.1 torchvision=0.9.1 cudatoolkit=10.2 
 conda install -y -c conda-forge tqdm=4.60.0 pillow=8.2.0 nltk=3.6.1
 conda install -y -c conda-forge nb_conda_kernels=2.3.1 jupyterlab=3.0.12

Download the dataset

Create a new folder data inside the project root folder, download the dataset, and extract it into the data folder.

The datasets that was used is the pix2code dataset. You can download the dataset from one of the following links: Google drive, GitHub.

This is what the folder structure of the data folder should look like:
```
 data
 ├── ...
 └── web
     └── all_data
         ├── AF4840B2-2B9F-4ED0-A58D-E260B14858E1.gui
         └── ...
```
The pix2code dataset contains three sub-datasets. The following steps are only concerned with the web dataset.
Split the dataset

The train.py and evaluate.py scripts assume the existence of three data split files train_dataset.txt, test_dataset.txt, and validation_dataset.txt, each containing the IDs of the data examples for the respective data split. The data split files have to be at the same folder level as the folder containing the data examples.

Run split_data.py to generate the data split files for the web dataset:
```
 python split_data.py --data_path=./data/web/all_data
```
This is what the data folder should look like up to this point:
```
 data
 ├── ...
 └── web
     ├── all_data
     |   ├── AF4840B2-2B9F-4ED0-A58D-E260B14858E1.gui
     │   └── ...
     ├── test_dataset.txt
     ├── train_dataset.txt
     └── validation_dataset.txt
```

Create the vocabulary file

You need to generate a vocab.txt file that contains all the tokens the model should be able to predict, separated by whitespaces.

Run build_vocab.py to generate a vocabulary file based on the tokens that appear in the specified dataset.

 python build_vocab.py --data_path=./data/web/all_data

This is what the data folder should look like up to this point:

 data
 ├── ...
 └── web
     ├── all_data
     |   ├── AF4840B2-2B9F-4ED0-A58D-E260B14858E1.gui
     │   └── ...
     ├── test_dataset.txt
     ├── train_dataset.txt
     ├── validation_dataset.txt
     └── vocab.txt

Train the model

Run the following command to train the model:

 python train.py --data_path=./data/web/all_data --epochs=15 --save_after_epochs=5 --batch_size=4 --split=train

Evaluate the model

Run the following command to evaluate the model:
```
 python evaluate.py --data_path=./data/web/all_data --model_file_path=<path-to-model-file> --split=validation --viz
```
To visualize the results of the model evaluation, run the evaluation script with the --viz flag and then follow the steps inside the visualize_inference.ipynb.

References & credits

Tony Beltramelli for the original pix2code paper and the dataset.
Imagine captioning tutorials: Basic idea of image captioning, Image captioning PyTorch, image captioning TensorFlow
Show, attend and tell paper for image captioning

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
docs		docs
html_transpiler		html_transpiler
tests		tests
.gitignore		.gitignore
README.md		README.md
build_vocab.py		build_vocab.py
dataset.py		dataset.py
environment.yml		environment.yml
evaluate.py		evaluate.py
models.py		models.py
split_data.py		split_data.py
train.py		train.py
utils.py		utils.py
visualize_inference.ipynb		visualize_inference.ipynb
vocab.py		vocab.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

pix2code pytroch implementation

Setup and run the model

References & credits

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

pix2code pytroch implementation

Setup and run the model

References & credits

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages