Skip to Main Content
Spotfire Ideas Portal
Status Future Consideration
Product Spotfire
Categories Custom Expressions
Created by Guest
Created on May 24, 2019

Random Sample data as Transformation option

Don't think this is currently possible. Currently, to do a random sample, I create a calculated column for random value like Rand(1). Then I filter the data to my sample size (e.g. I filter out 0.0 to 0.7 if I want a random sample that's 30% of the original). Then I have to export the data and re-import it to the analysis as a new data set. Would prefer a simple "random sampling" option as transformation. Maybe have it use different sampling methods (e.g. stratified sampling) too.

  • Attach files
  • Guest
    Reply
    |
    Jan 16, 2020

    It would be useful to have other sampling methods such as stratified, cluster, and multistage. These might perhaps be better suited as Expression functions since there are input variables... not sure.

     

    If used as a Transformation, however, I imagine the UI could be similar to how the Normalization Transformation is today. Since filtering is required for this, it might have to be a Transformation and not Expression.

  • Guest
    Reply
    |
    Jan 16, 2020

    Hey Thomas, I made a video on the row filter method: https://youtu.be/pL0p0ivjBH8

    Would still love to have this simpler though, if it's an easy enhancement. I use this often.

  • Guest
    Reply
    |
    Jun 3, 2019

    @Thomas - Yes, I have tried it with those two features and it works. Was looking for a simpler way if it's possible.

     

    Would be useful for model development but potentially also in dashboard design. (i.e. design dashboard with a smaller data set during development, then delete the transformation before publishing to production).

  • Guest
    Reply
    |
    Jun 3, 2019

    Have you tried the rand method in combination with a row filter transformation?

  • Guest
    Reply
    |
    May 28, 2019

    Great idea and very valuable in model development!