How can I extract data from a large ASCII file?

2 views (last 30 days)

anton fernando on 20 May 2014

0
Link

Direct link to this question

https://www.mathworks.com/matlabcentral/answers/130263-how-can-i-extract-data-from-a-large-ascii-file

Edited: Cedric on 20 May 2014

I have a ASCII data file with unknown number of columns and rows. In the file there are some unwanted text lines on top. I want to read only some of the columns in the data set with the header by removing the text lines on top. I appreciate if anyone can help.

0 Comments
Show -2 older commentsHide -2 older comments

Accepted Answer

Cedric on 20 May 2014

1
Link

Direct link to this answer

https://www.mathworks.com/matlabcentral/answers/130263-how-can-i-extract-data-from-a-large-ascii-file#answer_137508

Open in MATLAB Online

>> doc textscan

and use the parameter HeaderLines to skip the header. Supposing that you have 7 header lines and that you need columns 1 and 3 (discarding the rest of each line), you should have something like:

 content = fileread( 'myData.txt' ) ;
 data    = textscan( content, '%f%*f%f%*[^\n]', 'HeaderLines', 7, ...
                     'CollectOutput', true ) ;
 data    = data{1} ;

where you see a * in the formatSpec argument to skip column 2 and %*[^\n] at the end to skip the rest of the line.

14 Comments
Show 12 older commentsHide 12 older comments

Cedric on 20 May 2014

Edited: Cedric on 20 May 2014

Open in MATLAB Online

Yep, that's it. You probably updated the example to

 data = textscan( content, '%*f%*f%*f%f%*f%f%*[^\n]', 'HeaderLines', 12, ...
                  'CollectOutput', true ) ;
 data = data{1} ;

Now you can see your data e.g. per row, and you'll see that they are non-zero:

 >> data(50,:)
 ans =
   1.0e-03 *
    0.5940    0.0073

Optionally, if you really wanted to display the whole array and see non-zero entries, you could type

>> format long

Then ..

 >> data
 data =
0e+02 *
009380000000000  -9.990000000000000
008250000000000  -9.990000000000000
007240000000000  -9.990000000000000
006350000000000  -9.990000000000000
005550000000000  -9.990000000000000
004840000000000  -9.990000000000000
004200000000000  -9.990000000000000
003630000000000  -9.990000000000000
003130000000000  -9.990000000000000
002670000000000   0.000000614000000
002280000000000   0.000000292000000
001950000000000   0.000000110000000
001680000000000   0.000000076300000
001440000000000   0.000000035600000
001230000000000   0.000000047500000
001060000000000   0.000000042100000
000906000000000   0.000000042300000
000775000000000   0.000000042200000
000664000000000   0.000000043400000
000567000000000   0.000000045600000
000484000000000   0.000000047000000
000413000000000   0.000000048500000
000353000000000   0.000000049900000
000301000000000   0.000000051300000
000256000000000   0.000000052600000
   ...

but again, the fact that you displayed a lot of zeros initially is just a display artifact (truncation), so there is no need to set the display format to long if all you need is to compute with these numbers afterwards.

anton fernando on 20 May 2014

Thank you. I really appreciate your help.

Cedric on 20 May 2014

My pleasure!

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

How can I extract data from a large ASCII file?

0 Comments
Show -2 older commentsHide -2 older comments

Accepted Answer

14 Comments
Show 12 older commentsHide 12 older comments

More Answers (0)

See Also

Categories

Tags

Community Treasure Hunt

How can I extract data from a large ASCII file?

0 Comments Show -2 older commentsHide -2 older comments

Accepted Answer

14 Comments Show 12 older commentsHide 12 older comments

More Answers (0)

See Also

Categories

Tags

Community Treasure Hunt

0 Comments
Show -2 older commentsHide -2 older comments

14 Comments
Show 12 older commentsHide 12 older comments