What is synthetic data? I get asked this often in my line of work (data science). It is best to unpack the assumptions we make about data. We humans and all other living beings are analog creatures who experience a continuous stream of sensory experience in the form of sight, sound, taste, touch, and smell…. Read more →
Blog
Fintech Product Manager
This may be a predictor of where the importance of synthetic data is headed in the Fortune 500. Capital One recently posted this recruitment ad.
TechCrunch
Synthetic Image Generation is one aspect of digital synthetic data generation. There is a lot of interesting research in this area. See the article below from Evan Nisselson on how synthetic data is leveling the playing field for startups. https://techcrunch.com/2018/05/11/deep-learning-with-synthetic-data-will-democratize-the-tech-industry/
Is It Artificial?
MIT has released information about a new study that shows real and and artificial data can be used equally to generate similar results. The paper describes the Synthetic Data Vault (SDV), a system that builds machine learning models out of real databases in order to create artificial, or synthetic, data. You can learn more about… Read more →
New Thinking
Great leaps forward in scientific discoveries are often accompanied by certain risk and unintended consequences. Pushing the boundaries of AI using synthetic data is no different. AI innovations are emerging from research and development labs staffed by computer scientists, data scientists, cognitive scientists, mathematicians, statisticians, physicists, and other applied researchers working in a broad cross-section… Read more →
New Vocabulary
It is always easier to deal with one revolution at a time in technology – relational databases in the 80s, client/server networks in the 90s, and the Internet in the 2000s, for example. AI is a different kind of revolution since it touches all types of technology – hardware, software, networks, telecommunications, and mobile devices…. Read more →
Modeling Data
The theory and practice of data modeling has been with us for more than 30 years. You can learn more about the role of Dr. Peter Chen and the history of Entity Relationship diagrams and data modeling here – http://csc.lsu.edu/~chen/. At Synthetic IO we take a different approach – we like to model with data. … Read more →
IEEE Top 50
Use cases for synthetic data generation come in all shapes and sizes. We have compiled a list of the top 50 article abstracts from IEEE that reference use cases related to synthetic data. Please find the summaries below. 1. Statistical Methods for Generating SYNTHETIC Email Data Sets In: 2018 IEEE International Conference on Big Data… Read more →