This repository contains code and resources for detecting machine-generated text using the RoBERTa language model. The project includes a notebook and a paper submitted for Semeval 2024 - Task 8.
The detection of machine-generated text has become increasingly important due to the proliferation of automated content generation systems. This project aims to address this issue by utilizing RoBERTa, a state-of-the-art language model, for identifying machine-generated text.
- notebook.ipynb: Jupyter notebook containing the code for text detection using RoBERTa.
To use the code provided in the notebook, follow these steps:
# 1. Clone the repository:
# 2. Install the necessary dependencies. You can use pip:
pip install -r requirements.txtOpen and run the notebook.ipynb file in a Jupyter environment such as colab. Follow the instructions and execute the cells to perform machine-generated text detection using RoBERTa.
If you find this work valuable for your research, please consider citing the relevant paper or source for the transformer model used.
This project is licensed under the MIT License - see the LICENSE file for details.
We would like to acknowledge the developers of the RoBERTa language model and the organizers of Semeval 2024 - Task 8 for providing the dataset and evaluation framework for this research.