error reading a large binary file in MATLAB

error reading a large binary file in MATLAB - matlab

I have to read in a large Binary file whose size is 92,504 KB. When I am using fread command MATLAB is giving me error:
Error using fread Out of memory. Type HELP MEMORY for your options.
I tried to restart MATLAB also so that if I am using any virtual memory it should be cleared but still the problem persists.
How can I solve this problem of reading data.

The problem is the code that you are using to read the data:
[data,count] = fread(fid,'uint8');
The above line tells matlab to read in as many uint8s as it can and put them into a vector.
The trouble is that matlab will put it into a vector of doubles. So rather than a vector where each element is one byte, you have a vector where each element is 8 bytes. This ends up making your 92Mb of data take up 92*8 = 736mb which is probably going to be bigger than the maximum possible array size shown by the memory command.
The solution here is to tell matlab to put the data you are reading into a vector of uint8 which can be achieved as follows:
[data,count] = fread(fid,'*uint8');
This method for reading in the data tells matlab that the output vector should be the same type as the input data. You can read more about it in the precision section of the fread documentation.

In a 32-bit system, you may have very less memory available to MATLAB. The fread command you are using reads the entire file at once. This is probably a bad idea, since you system is not having enough memory. A better way to implement would be to read file part by part. See,
A = fread(fileID, sizeA)
in link below[1]. You can put this code inside a loop. In case you want to read whole file at once, what i would recommend is to use a 64-bit system with 3GB RAM.
[1] http://www.mathworks.in/help/matlab/ref/fread.html

Related

MEX function crashes when input variable size excceds limit

I wrote my bottleneck function into a parallelized MEX function with MATLAB coder and it works fine so far.
But at certain points the function crashes with an Unexpected unknown exception from MEX file.
I marked some points in the function that they will display letters like 'a' so that a see where in the MEX function the error happens since I can't put breakpoint inside anymore. It didn't even come to the first letter, so the error must happen while initializing the function.
I recognized that the error always happens when the input variable size exceeds a certain limit.
The main input variables alongside some double 1x1 values are 3 nx1 double arrays and one nxn logical matrix. n can vary in size and will change size during the iterations, so I declared the as :Inf:1 and :Inf:Inf: in coder.
The problem appears when n exceeds a value of about 45000. I don't know the exact value since in one iterations when its below the value everything is fine and in the next one when its above that one it crashes.
When analysing it further I saw that it seems to happen when the nxn matrix exceeds a size of 2^31 bytes.
When I want to translate my code into MEX with the matrix that doesn't load into the MEX function I get an error with something like "error with intmax()... not supported"
So I guess that coder writes MEX functions for 32-bit systems and when loading a matrix larger than that there is no pointer to point to the right element in the matrix due to the 32 bit system?
How do I bypass that problem?
Since the matrix is a boolean matrix I now found out that MATLAB supports sparse matrices and that this would save a lot of memory since only 1/10^4 of my elements are 1, but I still would like to know if the problem above results from the 32bit MEX functions and if so how to solve this problem?

Yes, this issue is caused by the MEX-file being limited to 32-bit indexing. This is a limit in Coder:
For variable-size arrays that use dynamic memory allocation, the maximum number of elements is the smaller of:
intmax('int32').
The largest power of 2 that fits in the C int data type on the target hardware.
These restrictions apply even on a 64-bit platform.
See this documentation page
As far as I can tell, the only workaround is to have Coder generate C or C++ code for you, and you manually editing the code to use size_t instead of int for indexing (and of course remove any checks for array size). I have never worked with code generated by Coder, so am not sure how hard this is. I have written many MEX-files in C and C++ though, and these MEX-files work perfectly fine with 64-bit indexing and larger than 2 Gb arrays.

How to write "Big Data" to a text file using Matlab

I am getting some readings off an accelerometer connected to an Arduino which is in turn connected to MATLAB through serial communication. I would like to write the readings into a text file. A 10 second reading will write around 1000 entries that make the text file size around 1 kbyte.
I will be using the following code:
%%%%%// Communication %%%%%
arduino=serial('COM6','BaudRate',9600);
fopen(arduino);
fileID = fopen('Readings.txt','w');
%%%%%// Reading from Serial %%%%%
for i=1:Samples
scan = fscanf(arduino,'%f');
if isfloat(scan),
vib = [vib;scan];
fprintf(fileID,'%0.3f\r\n',scan);
end
end
Any suggestions on improving this code ? Will this have a time or Size limit? This code is to be run for 3 days.

Do not use text files, use binary files. 42718123229.123123 is 18 bytes in ASCII, 4 bytes in a binary file. Don't waste space unnecessarily. If your data is going to be used later in MATLAB, then I just suggest you save in .mat files
Do not use a single file! Choose a reasonable file size (e.g. 100Mb) and make sure that when you get to that many amount of data you switch to another file. You could do this by e.g. saving a file per hour. This way you minimize the possible errors that may happen if the software crashes 2 minutes before finishing.

Now knowing the real dimensions of your problem, writing a text file is totally fine, nothing special is required to process such small data. But there is a problem with your code. You are writing a variable vid which increases over time. That may cause bad performance because you are not using preallocation and it may consume a lot of memory. I strongly recommend not to keep this variable, and if you need the dater read it afterwards.
Another thing you should consider is verification of your data. What do you do when you receive less samples than you expect? Include timestamps! Be aware that these timestamps are not precise because you add them afterwards, but it allows you to identify if just some random samples are missing (may be interpolated afterwards) or some consecutive series of maybe 100 samples is missing.

loading large csv files in Matlab

I had csv files of size 6GB and I tried using the import function on Matlab to load them but it failed due to memory issue. Is there a way to reduce the size of the files?
I think the no. of columns are causing the problem. I have a 133076 rows by 2329 columns. I had another file which is of the same no. of rows but only 12 rows and Matlab could handle that. However, once the columns increases, the files got really big.
Ulitmately, if I can read the data column wise so that I can have 2329 column vector of 133076, that will be great.
I am using Matlab 2014a

Numeric data are by default stored by Matlab in double precision format, which takes up 8 bytes per number. Data of size 133076 x 2329 therefore take up 2.3 GiB in memory. Do you have that much free memory? If not, reducing the file size won't help.
If the problem is not that the data themselves don't fit into memory, but is really about the process of reading such a large csv-file, then maybe using the syntax
M = csvread(filename,R1,C1,[R1 C1 R2 C2])
might help, which allows you to only read part of the data at one time. Read the data in chunks and assemble them in a (preallocated!) array.
If you do not have enough memory, another possibility is to read chunkwise and then convert each chunk to single precision before storing it. This reduces memory consumption by a factor of two.
And finally, if you don't process the data all at once, but can implement your algorithm such that it uses only a few rows or columns at a time, that same syntax may help you to avoid having all the data in memory at the same time.

Saving an array in matlab efficiently

I want to save an array efficiently in matlab. I have the array of size 3000 by 9000. If I save this array in the mat file it consumes around 214 MB using just the save function. If I use fwrite and use float data type came to be around 112. Is there any other way that I can still reduce the hard disk space consumed when I save this array in matlab?

I would suggest writing using binary mode and then using compression algorithm such as bzip

There are a few ways to reduce the required memory:
1. Reducing precision
Rather than using that double you normally have, consider using a single, or perhaps even a uint8 or logical. Using the print function will also so this, but you may want to consider compressing further afterwards as printing does not create a compressed file.
2. Utilizing a pattern
If your matrix has a certain pattern, this can sometimes be leveraged to store the data more efficiently. Or at least the information to create the data. The most common example is that your matrix is storeable as a few vectors. For example when it is sparse.

Using HDD memory for the MATLAB

As in my previous question I have the following problem. I have a matrix P nxn which elements are matrices P{i,j} which are also nxn. So the total amount of elements is n^4. For n=100 there is an error about the lack of memory. I calculate this matrix only one time and then operate with it. Could you advise me, how to store matrices P{i,j} on the HDD?
I mean that maybe it is possible to store each of them in a file like "data_i_j.dat" and then load it while doing computations in a loop for i and j?

The save function will write data to a file, and the load function will read it back again. save(filename,varname,varname,varname...), followed by S = load(filename) and referring to S.varname (there's also a version of load that just dumps stuff into your current workspace, but that seems like poor practice).

We Keep Coding

iphone swift flutter scala powershell matlab mongodb postgresql perl eclipse

error reading a large binary file in MATLAB - matlab

Related

MEX function crashes when input variable size excceds limit

How to write "Big Data" to a text file using Matlab

loading large csv files in Matlab

Saving an array in matlab efficiently

Using HDD memory for the MATLAB

Categories

Resources