πCSV File Upload
Overview
For users looking to quickly prototype or train models without setting up a direct database connection, PigeonsAI allows the creation of training datasets directly from CSV files. This method is straightforward and does not require complex configurations, making it ideal for initial testing and smaller datasets.
Requirements
To create a training dataset from a CSV file, ensure that your file meets the following criteria:
The CSV file should be structured with clearly defined columns.
The file must be accessible from the location where PigeonsAI is running.
The maximum size for the CSV file is 3 GB, which accommodates substantial data but ensures processing efficiency.
Creating a Training Dataset
Hereβs how you can create a training dataset using a CSV file in PigeonsAI:
Example Output
The training set URI outputted from the code above will be used to train a model.
Advantages of Using CSV Files for Prototyping
Quick Setup: No need for database credentials or configurations. Simply provide the path to the CSV file.
Flexibility: Easily test different datasets by switching out CSV files as needed.
Simplicity: Ideal for users who may not have the technical expertise to manage database connections.
Limitations
While using CSV files is convenient, it's important to consider that:
Only datasets up to 50 MB in size can be processed, which may not be suitable for very large data sets.
Data must be manually updated in the CSV file, unlike database connections that can pull updated data automatically.
Last updated