Help with extracting part of a string

25 views (last 30 days)
I have the string "2DVIS_data_08_40fs", and I want to extract the numbers included between the last underscore '_' and the last two letters 'fs', in this case 40. Given the string "2DVIS_data_08_120fs", I want as output 120 and so on. How can I do this? thanks

Accepted Answer

Chad Greene
Chad Greene on 26 Sep 2014
Guillame's answer is best, and it's probably good to learn how to use the power of regexp. But if you want a more intuitive-for-beginners approach, find the indices of the underscores, find the indices of fs, and then return everything between them:
somestring = '2DVIS_data_08_40fs';
underscore_indices = strfind(somestring,'_');
fs_indices = strfind(somestring,'fs');
yourNumber = str2double(somestring(underscore_indices(end)+1:fs_indices(end)-1))
  1 Comment
aurc89
aurc89 on 28 Sep 2014
Thank you, since I'm a beginner this is easier for me

Sign in to comment.

More Answers (3)

Stephen23
Stephen23 on 26 Sep 2014
Edited: Stephen23 on 27 Sep 2014
Use regexp .
Given a literal interpretation of your statement "extract the numbers included between the last underscore '_' and the last two letters 'fs'", this could be done using the following regexp match string:
>>A = {'2DVIS_data_08_120fs','2DVIS_data_08_40fs'};
>>B = regexp(A,'(?<=_)\d+(?=fs$)','once','match')
B =
'120' '40'
This match string locates one or more digits located between an '_' and the last two characters of the string (which must be 'fs'). If the strings are always of the same format, then matching of the last two characters would be enough.

Guillaume
Guillaume on 26 Sep 2014
Use regexp, you haven't explained the pattern in great details, the following may be what you want:
elems = regexp(s, '_(\d+)_[^_]*(..)$', 'tokens', 'once');
number = str2double(elems{1});
string = elems{2};
It will match any pattern with _, followed by 1 or more digit (an integer), followed by _, followed by any 0 or more characters not including _, followed by two characters at the end of the string. It extracts the integer and the last two characters.
  2 Comments
Guillaume
Guillaume on 27 Sep 2014
Realised I misunderstood the pattern. Stephen's or the following would work:
number = str2double(regexp(s, '_(\d+)..$', 'tokens', 'once'));
If you want to ensure that the last two characters are letters:
number = str2double(regexp(s, '_(\d+)[a-zA-Z]{2}', 'tokens', 'once'));
Or if it's always 'fs':
number = str2double(regexp(s, '_(\d+)fs', 'tokens', 'once'));
Basically, learn regexp.

Sign in to comment.


Albert Passy
Albert Passy on 14 Nov 2018
"Some people, when confronted with a problem, think "I know, I'll use regular expressions." Now they have two problems."
Regexes are expressive, compact and cryptic. reserve them for complex problems.
  1 Comment
Stephen23
Stephen23 on 14 Nov 2018
Edited: Stephen23 on 14 Nov 2018
Hmmm... but if we did not use them for simple problems, then we would not know how to use them properly for complex problems.

Sign in to comment.

Categories

Find more on Characters and Strings in Help Center and File Exchange

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!