Performance analysis fetching data with PDO and PHP.

March 28, 2011March 28, 2011 ~ Gonzalo Ayuso

Fetching data from databases is a common operation in our work as developers. There are many drivers (normally I use PDO), but the usage of all of them are similar and switch from one to another is not difficult (they almost share the same interface). In this post I will focus on fetching data. Basically we’ve got two functions: fetch and fetchAll. I’ve created two examples. One with fetch and another one with fetchAll:

// Example with fetch
error_reporting(-1);
$time = microtime(TRUE);
$mem = memory_get_usage();

$dbh = new PDO('pgsql:dbname=mydb;host=localhost', 'username', 'password');
$dbh->setAttribute(PDO::ATTR_ERRMODE, PDO::ERRMODE_EXCEPTION);

$stmt = $dbh->prepare('SELECT * FROM tableName limit 10000');
$stmt->execute();

$i=0;
while ($row = $stmt->fetch()) {
	$i++;
}
echo '
<h1>fetch()</h1>
';
echo '
<strong>{$i} </strong>

';
print_r(array('memory' => (memory_get_usage() - $mem) / (1024 * 1024), 'seconds' => microtime(TRUE) - $time));

// Example with fetchAll
error_reporting(-1);
$time = microtime(TRUE);
$mem = memory_get_usage();

$dbh = new PDO('pgsql:dbname=mydb;host=localhost', 'username', 'password');
$dbh->setAttribute(PDO::ATTR_ERRMODE, PDO::ERRMODE_EXCEPTION);

$stmt = $dbh->prepare('SELECT * FROM tableName limit 10000');
$stmt->execute();

$i=0;
$data = $stmt->fetchAll();
foreach ($data as $row) {
	$i++;
}

echo '
<h1>fetchAll()</h1>
';
echo '
<strong>{$i}</strong>

';
print_r(array('memory' => (memory_get_usage() - $mem) / (1024 * 1024), 'seconds' => microtime(TRUE) - $time));

if we execute the test we obtain:

fetchAll: [memory] => 31.305999755859
fetch: [memory] => 0.002532958984375

OK. It’s obvious. If we approach to the data extraction with fetchAll method we will use more memory. That’s because we’re mapping the whole recorded to a variable ($data) at once. With the fetch loop we are mapping only on row per iteration. By the way if we change the fetch loop to:

$data = array();
while ($row = $stmt->fetch()) {
	$i++;
	$data[] = $row;
}

We will use almost the same amount of memory than the fetchAll method
[memory] => 31.267543792725

Conclusion:
Is it better fetch than fetchAll? The answer is simple: No. We only need to take care what are we doing and use the best solution that fix to our need. If we’re handling small recordset, they’re similar, but if we work with big ones we need to realize that the memory usage we are using changes drastically if we use one method or another.

Published by Gonzalo Ayuso

Web Architect. PHP, Python, Node, Angular, ionic, PostgreSQL, Linux, ... Always learning. View all posts by Gonzalo Ayuso

17 thoughts on “Performance analysis fetching data with PDO and PHP.”

Ricardo Machado says:

March 28, 2011 at 1:57 pm

Hi Gonzalo,
First of all, nice post. Although for advanced developers it’s obvious, for others less-advanced it might not be. It is however entirely logical.

Still, I would love to see an article about performance comparisons between the use of objects and arrays (associative or not) for data manipulation.
IMHO, OOP is excellent because allows us to develop (web) applications more maintainable, portable and easily re-usable. However, performance it’s not exactly one of it’s benefits.

Arrays are, in a more natural and native way, a more faster and less resource-needed way to store data in memory and to manipulate it. Not so easy as the object-oriented way.

Hugz.

Reply
1. Gonzalo Ayuso says:
  
  March 28, 2011 at 4:31 pm
  
  txs. In fact I wrote this article because I’ve seen a lot of times fetchAll instead of fetch within projects without any reason. If we need to use it, it’s perfect but we always must bear in mind the significant memory usage differences. Probably the outcome is too obvious. BTW, I write your suggestion 😉
  
  Reply
Marcus Dalgren says:

March 28, 2011 at 2:15 pm

Are there any speed differences between the two methods?

Reply
1. Gonzalo Ayuso says:
  
  March 28, 2011 at 2:34 pm
  
  I’ve omitted speed because it’s almost the same. The main important difference is the memory usage, and I’ve focussed on it
  
  Reply
Pingback: Database abstraction layers in PHP. PDO versus DBAL « Gonzalo Ayuso | Web Architect
Tommi says:

December 29, 2011 at 10:41 am

Actually according to my tests speed difference is significant when using bigger data sets. My current problem is that fetchAll drains the memory and fetch is taking ages compared looping fetched all array.

Dev
http://www.epanorama.net

Reply
1. Gonzalo Ayuso says:
  
  December 31, 2011 at 4:14 pm
  
  The difference between both methods is only visible with big datasets, indeed. fetchAll is more comfortable (at least for me) but we need to take into account the big memory usage with big datasets.
  
  Do you flush output to the browser? Another possible problem is the Select statement. (if you use sub-queries within each the performance will be penalized). Another thing may be the latency between web-server and database server.
  
  Reply
pkr says:

March 19, 2012 at 4:13 pm

Hi Gonzalo,

I found that fetch() performs faster that fetchAll() with large set of data.With 1K of data fetch() took .1sec to create form while fetchAll took .8 sec.Can you please let me know the reason.

Reply
1. Gonzalo Ayuso says:
  
  May 15, 2012 at 2:53 pm
  
  I wrote this post to explain this issue. Read the second paragraph. It’s problably because the memory that PHP need to create the variable.
  
  Reply
Eric says:

April 10, 2014 at 1:27 pm

Hi, nice shot, but I want to implement it on PDOStatement::execute(). How we can implement this “telemetry” on that? I tried to use __call(), but it don’t works with public methods.

So, any ideas?

Hugs

Reply
1. Gonzalo Ayuso says:
  
  April 13, 2014 at 3:18 pm
  
  I don’t understand your question. Can you show me an example?
  
  Reply
  1. Eric says:
    
    April 14, 2014 at 2:44 pm
    My idea is implement your techniques to monitor every single call for PDOStatement::execute();
    
    This way, we can monitor EVERY single call to this method automaticaly. So, we don’t need to write your code for every call to PDOStatement::execute(). Get it?
    
    My initial idea was to use “magic methods”, but unfortunately PHP don’t have one magic method what intercepts every single call.
    
    http://www.php.net/manual/es/language.oop5.magic.php
    
    What I wanna do is something like:
    info = array(‘memory’ => (memory_get_usage() – $mem) / (1024 * 1024), ‘seconds’ => microtime(TRUE) – $time);
    
    } else {
    
    // execute \PDOStatement::__call() for another methods
    parent::__call($name, $arguments);
    
    }
    }
    }
    
    // with this class implementation, it would be possible:
    
    $pdo = new \PDO($dsn, $user, $password);
    
    // it need to return MyPDOStatement instead PDOStatement
    $sth = $pdo->prepare(‘my big heavy SQL here’);
    $sth->execute(array(‘param’ => ‘value’));
    
    // here we get our memory and time info
    echo ‘
    
    . 'print_r($sth->info, true) . '
    
    ;
    
    ?>
    
    tks.
Eric says:

April 14, 2014 at 8:40 pm

Sorry, my last post exploded…

What I meant was something like it:
http://www.coderholic.com/php-database-query-logging-with-pdo/

My problem was, how to extend PDO and PDOStatement classes to include your functionalities.

Please delete last post 😉

Reply
Jc Tan says:

June 26, 2014 at 9:29 am

nice! thanks for the information 😀

Reply
HellsAn631 says:

June 28, 2014 at 10:05 am

I don’t understand the implications of this post. It seems kind of obvious, and its not the fact that fetchAll uses more memory that is the key thing here.

What fetchAll is doing is loading all of the values (10,000) to a single array in memory.

(total variables = 10,000)

With the regular fetch() statement, your loading each fetch into the same variable. At the end of the application, you are left with only the one variable, the last fetch statement.

(total variables = 1)

Basically this entire article can be summed up to “fetch only loads one variable, and fetchAll() loads all the data. More Data = More Memory.”

Reply
1. Gonzalo Ayuso says:
  
  June 29, 2014 at 12:07 pm
  
  Yes. When I wrote this post I wanted to measure it. I wanted to show with numbers the big difference between fetch and fetchAll (especially with big datasets)
  
  Reply
Pingback: fetch() กับ fetchALL() ใช้อะไรดี robruu_online #3 – lagman's Blog