About visualization of data distributions on different sources #3

pILLOW-1 · 2024-09-28T08:10:20Z

Hi, great work!
I have two questions about visualization of data distributions on different sources in fig.1.
Q1: Is the generated data visualized here learned from the corresponding data source? For example, in the first row, the data from stable diffusion is learned on LVIS train, right?
Q2: Based on the idea that generative data can expand the data distribution that the model can learn, is it possible for a generative model trained solely on one domain to generate data from another domain(e.g., model trained on training set can generate data similar to data in testing set)?
Looking forward to your answers!

pILLOW-1 · 2024-09-28T08:13:51Z

Addition to Q1: In the second row, the data from DeepFloyd is learned on LVIS val?

leaf1170124460 · 2024-09-30T03:42:47Z

Hi, @pILLOW-1.

Thank you for your interest in our work!

Regarding Q1: There is no "learned from the corresponding data source" relationship between the two data sources in each row. Each subplot represents the visualization of embeddings after dimension reduction of data from the respective source. For example, the LVIS train subplot shows the embeddings of all instances in LVIS train after dimension reduction, while the Stable Diffusion subplot shows the embeddings of the data generated by Stable Diffusion after dimension reduction.

Regarding Q2: In DiverGen, we did not retrain or fine-tune the pre-trained generative models. We only used the open-source pre-trained weights for data generation. However, we think your suggestion is intriguing, and we may explore it further if time allows.

Hope this clears up your confusion, and feel free to reach out if you have any more questions!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

About visualization of data distributions on different sources #3

About visualization of data distributions on different sources #3

pILLOW-1 commented Sep 28, 2024 •

edited

Loading

pILLOW-1 commented Sep 28, 2024

leaf1170124460 commented Sep 30, 2024 •

edited

Loading

About visualization of data distributions on different sources #3

About visualization of data distributions on different sources #3

Comments

pILLOW-1 commented Sep 28, 2024 • edited Loading

pILLOW-1 commented Sep 28, 2024

leaf1170124460 commented Sep 30, 2024 • edited Loading

pILLOW-1 commented Sep 28, 2024 •

edited

Loading

leaf1170124460 commented Sep 30, 2024 •

edited

Loading