Fake data generator python8/10/2023 ![]() ![]() If it’s technical waffle you want, here’s a classic pure fake computer science paper generator, SciGen is a classic, though it may take a bit of digging to find the required dependencies to run it… ![]() For example, loremipsum or collective.loremipsum. Searching for “sentence generator” will also turn up some handy packages…: markovify or markovipy, for example. If you prefer using neural network models, there are those too: textgenrnn. If you like waffle, here’s a WaffleGenerator. For example, you can generate test datasets using SciKit Learn and here’s one of my early attempts at generating 2D numerical data to demonstrate different correlation coefficients.įor text strings, generators are often referred to as ‘lorem ipsum’ generators ( why?). You can also generate numerical data with various statistical properties. (We’re about to start looking at producing a new machine learning course, so stumbling across that sort of possible requirement is quite timely…)īy chance, whilst searching for something else, I spotted this article describing pydbgen, a simple Python package for generating fake data tables to test simple database systems.Ī quick trawl turns up other packages for doing similar things, such as mimesis, or faker, which also inspired this more general R package, charlatan. It also means you can test as much as you want without having to expose any real data.Īccording to this article - Synthetic data generation - a must-have skill for new data scientists - knowing how to create effective test data is one of those new skills folk are going to have to learn. When developing or testing a data system, it often makes sense to try it out with some data that looks real, but isn’t real, just in case something goes wrong… ![]()
0 Comments
Leave a Reply.AuthorWrite something about yourself. No need to be fancy, just an overview. ArchivesCategories |