Examples 

The best way to understand how this works, and what is useful for, it’s probably with some examples, so here we go:

CSV input format 

The CSV should be just a simple file, with no headers, where each line represents an attribute of a table.

For example, take the following data:

Just 1 table with multiple items 

Okay, that table it’s kind of awful. Really, really unnormalized.

A sligthly better version could be:

Table (recipes)
recipe_id	name	ingredient
1	Chimichurri	1/2 Cup Oil
1	Chimichurri	2 tablespoons red wine vinegar
1	Chimichurri	1/2 cup finely chopped parsley
1	Chimichurri	1 tablespoon finely chopped chili
1	Chimichurri	1 teaspoon salt
1	Chimichurri	(…)

Since the information that the CSV file contains are the columns (attributes) names, the table to where they belong and the sql syntax that generates them, the ONLY difference in this file would be the name of the third column: as we now denormalized the table a little bit, we now have an entry (row) for each ingredient, so we changed ingredients by ingredient (without the last “s”)

recipes, recipe_id, integer primary key
recipes, name, text not null
recipes, ingredient, text

So the other files would change in a similar way.

2 tables 

If we go a little bit further in normalization, the first thing we should do, would be to split the recipe’s name from the ingredients, right?

Table 1 (recipes)
recipe_id (primary_key)	name
1	Chimichurri

Table 2 (ingredients)
ingredient_id (primary_key)	ingredient	recipe_id (foreign_key)
1	1/2 Cup Oil	1
2	2 tablespoons red wine vinegar	1
3	1/2 cup finely chopped parsley	1
4	1 tablespoon finely chopped chili	1
5	1 teaspoon salt	1
6	(…)	(…)

Now the recipe_id attribute is not only the primary_key of the first table (recipes), but it’s also the foreign_key of the second table (ingredients)

The CSV (example3.csv) now does have some ‘major’ changes:

recipes, recipe_id, integer primary key
recipes, name, text not null
ingredients, ingredient_id, integer primary key
ingredients, ingredient, text not null
ingredients, recipe_id, integer foreign key (recipes)

And the ERD diagram (example3.uml) is transformed to:

See in PlantUML server (and the script’s output)

Several tables 

If we go a little bit further in normalizing this scheme, we can break these tables a little bit more into:

Table 1 (recipes)
recipe_id (primary_key)	name	type_id (foreign_key)
1	Chimichurri	2
2	Criolla	2
3	Flan	5

Table 2 (recipe_type)
type_id (primary_key)	type_name
1	Starter
2	Dressing
3	Main course
4	Side dish
5	Dessert

Table 3 (recipe_ingredients)
recipe_id (foreign_key)	ingredient_id (foreign_key)	quantity	unit_of_measurement	preparation
1	1	1/2	Cup
1	2	2	tablespoons
1	3	1/2	Cup	Finely chopped

Table 4 (ingredient_list)
ingredient_id (primary_key)	ingredient_name
1	Sunflower Oil
2	Red wine vinegar
3	Parsley
4	Chili
5	Salt

To generate an ERD diagram, the input could be (example4.csv):

recipes, recipe_id, integer primary key
recipes, name, text
recipes, type_id, integer foreign key (recipe_type)
recipe_type, type_id, integer primary key
recipe_type, type_name, text not null
recipe_ingredients, recipe_id, integer foreign key (recipes)
recipe_ingredients, ingredient_id, integer foreign key (ingredient_list)
recipe_ingredients, quantity, real not null
recipe_ingredients, unit_of_measurement, text not null
recipe_ingredients, preparation, text
ingredient_list, ingredient_id, integer primary key
ingredient_list, ingredient_name, text not null

With this input, we get this output and the ERD diagram (example4.uml):

See in PlantUML server

Examples

CSV input format

Just 1 table

Just 1 table with multiple items

2 tables

Several tables

Examples 

CSV input format 

Just 1 table 

Just 1 table with multiple items 

2 tables 

Several tables 