Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add support for categorical columns #33

Open
juliohm opened this issue Nov 30, 2023 · 5 comments
Open

Add support for categorical columns #33

juliohm opened this issue Nov 30, 2023 · 5 comments
Labels
enhancement New feature or request

Comments

@juliohm
Copy link
Collaborator

juliohm commented Nov 30, 2023

It would be nice to have some support for categorical columns (e.g. columns of strings).

We are building a stack on top of DataScienceTraits.jl to recognize the different scientific types of columns and dispatch different behavior accordingly: https://github.com/JuliaML/DataScienceTraits.jl

Would it be a good idea to add the package as a dependency, and then adjust the plots whenever a column is Categorical?

Notice that the package is lightweight, and all specializations live in package extensions.

@juliohm juliohm added the enhancement New feature or request label Nov 30, 2023
@sefffal
Copy link
Owner

sefffal commented Dec 1, 2023

Interesting. Currently the visualizations layers are chosen based on their position (diagonal or body) but I could see this being extended to different visualizations per pair of variables. It might need some restructuring.

How would you expect categorical plots to look?

@juliohm
Copy link
Collaborator Author

juliohm commented Dec 1, 2023

I would expect categorical variables to show as bar plots in the diagonal and some other custom visualization in the off-diagonal plots. We could also imagine a situation where users just want to use categorical columns to color the plots of continuous variables.

@juliohm
Copy link
Collaborator Author

juliohm commented Apr 26, 2024

@sefffal do you have any idea of how much work is required to refactor PairPlots.jl to include such features? It would be amazing to use the package in more advanced data science contexts.

We are using the package in university short-courses, and students always love it.

@sefffal
Copy link
Owner

sefffal commented Apr 27, 2024

Appreciate the bump @juliohm . I am still interested in adding this feature but it's hard for me to justify prioritizing it vs my own research at the present moment.

Sorry for the disappointing answer but it's better to set realistic expectations.

If you have eg a student willing to work on this I could support them in making the necessary changes.

@juliohm
Copy link
Collaborator Author

juliohm commented Apr 27, 2024 via email

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

2 participants