Finding unique set from large dataset

1 view (last 30 days)
Hello!
How would one go about to find a unique set of variables that can depict shortly which variables are used in a large dataset using a matlab code?
For example if the english dictionary is my large dataset, I want the output to be the 26 alphabets-saying these are the unique variables used in your large dataset.
Another example
If x = {"abc", "bcd", "ded"}
I want the output as {"a","b","c","d","e"}
Thanks in advance!

Accepted Answer

madhan ravi
madhan ravi on 18 Jul 2019
a = cellfun(@(z)regexp(z,'.','match'),x,'un',0);
Output = num2cell(unique([a{:}]))
  5 Comments
madhan ravi
madhan ravi on 18 Jul 2019
Edited: madhan ravi on 18 Jul 2019
Thank you Guillaume :)
Sanjana Sankar
Sanjana Sankar on 19 Jul 2019
Thank you all. I was looking for the output from Guillaume's method. Thanks a lot!!

Sign in to comment.

More Answers (2)

Waqar Ali Memon
Waqar Ali Memon on 18 Jul 2019

Bruno Luong
Bruno Luong on 18 Jul 2019
Edited: Bruno Luong on 18 Jul 2019
x = ["abc", "bcd", "ded"] % no need using curly bracket for strings
string(unique(cat(2,x{:}))')'

Categories

Find more on Data Type Conversion in Help Center and File Exchange

Tags

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!