~percona-toolkit-dev/percona-toolkit/1.0

« back to all changes in this revision

Viewing changes to docs/user/pt-visual-explain.rst

Committer: Daniel Nichter
Date: 2011-07-14 19:08:47 UTC
Revision ID: daniel@percona.com-20110714190847-lggalkuvdrh7c4jp

Add standard pkg files (COPYING, README, etc.), percona-toolkit.pod, and user docs. Remove dev/docs/html.

files added:
COPYING

Changelog

INSTALL

Makefile.PL

README

config/sphinx-build

config/sphinx-build/_static

config/sphinx-build/_templates

config/sphinx-build/conf.py

docs/percona-toolkit.pod

docs/user/index.rst

docs/user/pt-align.rst

docs/user/pt-archiver.rst

docs/user/pt-checksum-filter.rst

docs/user/pt-collect.rst

docs/user/pt-config-diff.rst

docs/user/pt-deadlock-logger.rst

docs/user/pt-diskstats.rst

docs/user/pt-duplicate-key-checker.rst

docs/user/pt-fifo-split.rst

docs/user/pt-find.rst

docs/user/pt-fk-error-logger.rst

docs/user/pt-heartbeat.rst

docs/user/pt-index-usage.rst

docs/user/pt-kill.rst

docs/user/pt-log-player.rst

docs/user/pt-mext.rst

docs/user/pt-mysql-summary.rst

docs/user/pt-online-schema-change.rst

docs/user/pt-pmp.rst

docs/user/pt-profile-compact.rst

docs/user/pt-query-advisor.rst

docs/user/pt-query-digest.rst

docs/user/pt-query-profiler.rst

docs/user/pt-rel.rst

docs/user/pt-show-grants.rst

docs/user/pt-sift.rst

docs/user/pt-slave-delay.rst

docs/user/pt-slave-find.rst

docs/user/pt-slave-restart.rst

docs/user/pt-stalk.rst

docs/user/pt-summary.rst

docs/user/pt-table-checksum.rst

docs/user/pt-table-sync.rst

docs/user/pt-tcp-model.rst

docs/user/pt-trend.rst

docs/user/pt-upgrade.rst

docs/user/pt-usl.rst

docs/user/pt-variable-advisor.rst

docs/user/pt-visual-explain.rst

docs/user/tools.rst

files removed:
docs/dev/html

docs/dev/html/files

docs/dev/html/files/modules

docs/dev/html/files/modules/Advisor-pm.html

docs/dev/html/files/modules/AdvisorRules-pm.html

docs/dev/html/files/modules/BinaryLogParser-pm.html

docs/dev/html/files/modules/ChangeHandler-pm.html

docs/dev/html/files/modules/CompareQueryTimes-pm.html

docs/dev/html/files/modules/CompareResults-pm.html

docs/dev/html/files/modules/CompareTableStructs-pm.html

docs/dev/html/files/modules/CompareWarnings-pm.html

docs/dev/html/files/modules/CopyRowsInsertSelect-pm.html

docs/dev/html/files/modules/DSNParser-pm.html

docs/dev/html/files/modules/Daemon-pm.html

docs/dev/html/files/modules/DuplicateKeyFinder-pm.html

docs/dev/html/files/modules/EventAggregator-pm.html

docs/dev/html/files/modules/EventTimeline-pm.html

docs/dev/html/files/modules/ExecutionThrottler-pm.html

docs/dev/html/files/modules/ExplainAnalyzer-pm.html

docs/dev/html/files/modules/FileIterator-pm.html

docs/dev/html/files/modules/ForeignKeyIterator-pm.html

docs/dev/html/files/modules/GeneralLogParser-pm.html

docs/dev/html/files/modules/HTTPProtocolParser-pm.html

docs/dev/html/files/modules/IndexUsage-pm.html

docs/dev/html/files/modules/InnoDBStatusParser-pm.html

docs/dev/html/files/modules/KeySize-pm.html

docs/dev/html/files/modules/LogSplitter-pm.html

docs/dev/html/files/modules/MaatkitTest-pm.html

docs/dev/html/files/modules/MasterSlave-pm.html

docs/dev/html/files/modules/MemcachedEvent-pm.html

docs/dev/html/files/modules/MemcachedProtocolParser-pm.html

docs/dev/html/files/modules/MockSth-pm.html

docs/dev/html/files/modules/MockSync-pm.html

docs/dev/html/files/modules/MockSyncStream-pm.html

docs/dev/html/files/modules/MySQLConfig-pm.html

docs/dev/html/files/modules/MySQLConfigComparer-pm.html

docs/dev/html/files/modules/MySQLDump-pm.html

docs/dev/html/files/modules/MySQLProtocolParser-pm.html

docs/dev/html/files/modules/OSCCaptureSync-pm.html

docs/dev/html/files/modules/OptionParser-pm.html

docs/dev/html/files/modules/Outfile-pm.html

docs/dev/html/files/modules/PgLogParser-pm.html

docs/dev/html/files/modules/Pipeline-pm.html

docs/dev/html/files/modules/PodParser-pm.html

docs/dev/html/files/modules/Processlist-pm.html

docs/dev/html/files/modules/ProcesslistAggregator-pm.html

docs/dev/html/files/modules/Progress-pm.html

docs/dev/html/files/modules/ProtocolParser-pm.html

docs/dev/html/files/modules/QueryAdvisorRules-pm.html

docs/dev/html/files/modules/QueryParser-pm.html

docs/dev/html/files/modules/QueryReportFormatter-pm.html

docs/dev/html/files/modules/QueryReview-pm.html

docs/dev/html/files/modules/QueryRewriter-pm.html

docs/dev/html/files/modules/Quoter-pm.html

docs/dev/html/files/modules/ReportFormatter-pm.html

docs/dev/html/files/modules/Retry-pm.html

docs/dev/html/files/modules/RowDiff-pm.html

docs/dev/html/files/modules/Runtime-pm.html

docs/dev/html/files/modules/SQLParser-pm.html

docs/dev/html/files/modules/Sandbox-pm.html

docs/dev/html/files/modules/Schema-pm.html

docs/dev/html/files/modules/SchemaIterator-pm.html

docs/dev/html/files/modules/SimpleTCPDumpParser-pm.html

docs/dev/html/files/modules/SlowLogParser-pm.html

docs/dev/html/files/modules/SlowLogWriter-pm.html

docs/dev/html/files/modules/SysLogParser-pm.html

docs/dev/html/files/modules/TCPRequestAggregator-pm.html

docs/dev/html/files/modules/TableChecksum-pm.html

docs/dev/html/files/modules/TableChunker-pm.html

docs/dev/html/files/modules/TableNibbler-pm.html

docs/dev/html/files/modules/TableParser-pm.html

docs/dev/html/files/modules/TableSyncChunk-pm.html

docs/dev/html/files/modules/TableSyncGroupBy-pm.html

docs/dev/html/files/modules/TableSyncNibble-pm.html

docs/dev/html/files/modules/TableSyncStream-pm.html

docs/dev/html/files/modules/TableSyncer-pm.html

docs/dev/html/files/modules/TableUsage-pm.html

docs/dev/html/files/modules/TcpdumpParser-pm.html

docs/dev/html/files/modules/TextResultSetParser-pm.html

docs/dev/html/files/modules/TimeSeriesTrender-pm.html

docs/dev/html/files/modules/Transformers-pm.html

docs/dev/html/files/modules/UpgradeReportFormatter-pm.html

docs/dev/html/files/modules/VariableAdvisorRules-pm.html

docs/dev/html/files/modules/VersionParser-pm.html

docs/dev/html/files/tools

docs/dev/html/files/tools/pt-archiver-pm.html

docs/dev/html/files/tools/pt-config-diff-pm.html

docs/dev/html/files/tools/pt-deadlock-logger-pm.html

docs/dev/html/files/tools/pt-duplicate-key-checker-pm.html

docs/dev/html/files/tools/pt-fifo-split-pm.html

docs/dev/html/files/tools/pt-find-pm.html

docs/dev/html/files/tools/pt-fk-error-logger-pm.html

docs/dev/html/files/tools/pt-heartbeat-pm.html

docs/dev/html/files/tools/pt-index-usage-pm.html

docs/dev/html/files/tools/pt-kill-pm.html

docs/dev/html/files/tools/pt-log-player-pm.html

docs/dev/html/files/tools/pt-online-schema-change-pm.html

docs/dev/html/files/tools/pt-profile-compact-pm.html

docs/dev/html/files/tools/pt-query-advisor-pm.html

docs/dev/html/files/tools/pt-query-digest-pm.html

docs/dev/html/files/tools/pt-query-profiler-pm.html

docs/dev/html/files/tools/pt-schema-advisor-pm.html

docs/dev/html/files/tools/pt-show-grants-pm.html

docs/dev/html/files/tools/pt-slave-delay-pm.html

docs/dev/html/files/tools/pt-slave-find-pm.html

docs/dev/html/files/tools/pt-slave-restart-pm.html

docs/dev/html/files/tools/pt-table-checksum-pm.html

docs/dev/html/files/tools/pt-table-sync-pm.html

docs/dev/html/files/tools/pt-table-usage-pm.html

docs/dev/html/files/tools/pt-tcp-model-pm.html

docs/dev/html/files/tools/pt-trend-pm.html

docs/dev/html/files/tools/pt-upgrade-pm.html

docs/dev/html/files/tools/pt-variable-advisor-pm.html

docs/dev/html/files/tools/pt-visual-explain-pm.html

docs/dev/html/index

docs/dev/html/index.html

docs/dev/html/index/Classes.html

docs/dev/html/index/Functions.html

docs/dev/html/index/Functions10.html

docs/dev/html/index/Functions11.html

docs/dev/html/index/Functions2.html

docs/dev/html/index/Functions3.html

docs/dev/html/index/Functions4.html

docs/dev/html/index/Functions5.html

docs/dev/html/index/Functions6.html

docs/dev/html/index/Functions7.html

docs/dev/html/index/Functions8.html

docs/dev/html/index/Functions9.html

docs/dev/html/index/General.html

docs/dev/html/index/General10.html

docs/dev/html/index/General11.html

docs/dev/html/index/General12.html

docs/dev/html/index/General13.html

docs/dev/html/index/General14.html

docs/dev/html/index/General15.html

docs/dev/html/index/General2.html

docs/dev/html/index/General3.html

docs/dev/html/index/General4.html

docs/dev/html/index/General5.html

docs/dev/html/index/General6.html

docs/dev/html/index/General7.html

docs/dev/html/index/General8.html

docs/dev/html/index/General9.html

docs/dev/html/index/Variables.html

docs/dev/html/index/Variables2.html

docs/dev/html/javascript

docs/dev/html/javascript/main.js

docs/dev/html/javascript/prettify.js

docs/dev/html/javascript/searchdata.js

docs/dev/html/search

docs/dev/html/search/ClassesA.html

docs/dev/html/search/ClassesB.html

docs/dev/html/search/ClassesC.html

docs/dev/html/search/ClassesD.html

docs/dev/html/search/ClassesE.html

docs/dev/html/search/ClassesF.html

docs/dev/html/search/ClassesG.html

docs/dev/html/search/ClassesH.html

docs/dev/html/search/ClassesI.html

docs/dev/html/search/ClassesK.html

docs/dev/html/search/ClassesL.html

docs/dev/html/search/ClassesM.html

docs/dev/html/search/ClassesO.html

docs/dev/html/search/ClassesP.html

docs/dev/html/search/ClassesQ.html

docs/dev/html/search/ClassesR.html

docs/dev/html/search/ClassesS.html

docs/dev/html/search/ClassesT.html

docs/dev/html/search/ClassesU.html

docs/dev/html/search/ClassesV.html

docs/dev/html/search/FunctionsA.html

docs/dev/html/search/FunctionsB.html

docs/dev/html/search/FunctionsC.html

docs/dev/html/search/FunctionsD.html

docs/dev/html/search/FunctionsE.html

docs/dev/html/search/FunctionsF.html

docs/dev/html/search/FunctionsG.html

docs/dev/html/search/FunctionsH.html

docs/dev/html/search/FunctionsI.html

docs/dev/html/search/FunctionsJ.html

docs/dev/html/search/FunctionsK.html

docs/dev/html/search/FunctionsL.html

docs/dev/html/search/FunctionsM.html

docs/dev/html/search/FunctionsN.html

docs/dev/html/search/FunctionsO.html

docs/dev/html/search/FunctionsP.html

docs/dev/html/search/FunctionsQ.html

docs/dev/html/search/FunctionsR.html

docs/dev/html/search/FunctionsS.html

docs/dev/html/search/FunctionsSymbols.html

docs/dev/html/search/FunctionsT.html

docs/dev/html/search/FunctionsU.html

docs/dev/html/search/FunctionsV.html

docs/dev/html/search/FunctionsW.html

docs/dev/html/search/GeneralA.html

docs/dev/html/search/GeneralB.html

docs/dev/html/search/GeneralC.html

docs/dev/html/search/GeneralD.html

docs/dev/html/search/GeneralE.html

docs/dev/html/search/GeneralF.html

docs/dev/html/search/GeneralG.html

docs/dev/html/search/GeneralH.html

docs/dev/html/search/GeneralI.html

docs/dev/html/search/GeneralJ.html

docs/dev/html/search/GeneralK.html

docs/dev/html/search/GeneralL.html

docs/dev/html/search/GeneralM.html

docs/dev/html/search/GeneralN.html

docs/dev/html/search/GeneralO.html

docs/dev/html/search/GeneralP.html

docs/dev/html/search/GeneralQ.html

docs/dev/html/search/GeneralR.html

docs/dev/html/search/GeneralS.html

docs/dev/html/search/GeneralSymbols.html

docs/dev/html/search/GeneralT.html

docs/dev/html/search/GeneralU.html

docs/dev/html/search/GeneralV.html

docs/dev/html/search/GeneralW.html

docs/dev/html/search/NoResults.html

docs/dev/html/search/VariablesA.html

docs/dev/html/search/VariablesB.html

docs/dev/html/search/VariablesC.html

docs/dev/html/search/VariablesD.html

docs/dev/html/search/VariablesE.html

docs/dev/html/search/VariablesF.html

docs/dev/html/search/VariablesG.html

docs/dev/html/search/VariablesH.html

docs/dev/html/search/VariablesI.html

docs/dev/html/search/VariablesL.html

docs/dev/html/search/VariablesM.html

docs/dev/html/search/VariablesN.html

docs/dev/html/search/VariablesO.html

docs/dev/html/search/VariablesP.html

docs/dev/html/search/VariablesQ.html

docs/dev/html/search/VariablesR.html

docs/dev/html/search/VariablesS.html

docs/dev/html/search/VariablesT.html

docs/dev/html/search/VariablesU.html

docs/dev/html/search/VariablesV.html

docs/dev/html/search/VariablesW.html

docs/dev/html/styles

docs/dev/html/styles/main.css

files modified:
.bzrignore

bin/pt-log-player

util/write-user-docs

Show diffs side-by-side

added added

removed removed

docs/user/pt-visual-explain.rst

#################

pt-visual-explain

#################

.. highlight:: perl

****

NAME

****

pt-visual-explain - Format EXPLAIN output as a tree.

********

SYNOPSIS

********

Usage: pt-visual-explain [OPTION...] [FILE...]

pt-visual-explain transforms EXPLAIN output into a tree representation of

the query plan. If FILE is given, input is read from the file(s). With no

FILE, or when FILE is -, read standard input.

Examples:

.. code-block:: perl

pt-visual-explain <file_containing_explain_output>

pt-visual-explain -c <file_containing_query>

mysql -e "explain select * from mysql.user" | pt-visual-explain

*****

RISKS

*****

The following section is included to inform users about the potential risks,

whether known or unknown, of using this tool. The two main categories of risks

are those created by the nature of the tool (e.g. read-only tools vs. read-write

tools) and those created by bugs.

pt-visual-explain is read-only and very low-risk.

At the time of this release, we know of no bugs that could cause serious harm to

users.

The authoritative source for updated information is always the online issue

tracking system. Issues that affect this tool will be marked as such. You can

see a list of such issues at the following URL:

`http://www.percona.com/bugs/pt-visual-explain <http://www.percona.com/bugs/pt-visual-explain>`_.

See also "BUGS" for more information on filing bugs and getting help.

***********

DESCRIPTION

***********

pt-visual-explain reverse-engineers MySQL's EXPLAIN output into a query

execution plan, which it then formats as a left-deep tree -- the same way the

plan is represented inside MySQL. It is possible to do this by hand, or to read

EXPLAIN's output directly, but it requires patience and expertise. Many people

find a tree representation more understandable.

You can pipe input into pt-visual-explain or specify a filename at the

command line, including the magical '-' filename, which will read from standard

input. It can do two things with the input: parse it for something that looks

like EXPLAIN output, or connect to a MySQL instance and run EXPLAIN on the

input.

When parsing its input, pt-visual-explain understands three formats: tabular

like that shown in the mysql command-line client, vertical like that created by

using the \G line terminator in the mysql command-line client, and tab

separated. It ignores any lines it doesn't know how to parse.

When executing the input, pt-visual-explain replaces everything in the input

up to the first SELECT keyword with 'EXPLAIN SELECT,' and then executes the

result. You must specify "--connect" to execute the input as a query.

Either way, it builds a tree from the result set and prints it to standard

output. For the following query,

.. code-block:: perl

select * from sakila.film_actor join sakila.film using(film_id);

pt-visual-explain generates this query plan:

100

101

102

.. code-block:: perl

103

104

JOIN

105

+- Bookmark lookup

106

| +- Table

107

| | table film_actor

108

| | possible_keys idx_fk_film_id

109

| +- Index lookup

110

| key film_actor->idx_fk_film_id

111

| possible_keys idx_fk_film_id

112

| key_len 2

113

| ref sakila.film.film_id

114

| rows 2

115

+- Table scan

116

rows 952

117

+- Table

118

table film

119

possible_keys PRIMARY

120

121

122

The query plan is left-deep, depth-first search, and the tree's root is the

123

output node -- the last step in the execution plan. In other words, read it

124

like this:

125

126

127

128

129

Table scan the 'film' table, which accesses an estimated 952 rows.

130

131

132

133

134

135

For each row, find matching rows by doing an index lookup into the

136

film_actor->idx_fk_film_id index with the value from sakila.film.film_id, then a

137

bookmark lookup into the film_actor table.

138

139

140

141

For more information on how to read EXPLAIN output, please see

142

`http://dev.mysql.com/doc/en/explain.html <http://dev.mysql.com/doc/en/explain.html>`_, and this talk titled "Query

143

Optimizer Internals and What's New in the MySQL 5.2 Optimizer," from Timour

144

Katchaounov, one of the MySQL developers:

145

`http://maatkit.org/presentations/katchaounov_timour.pdf <http://maatkit.org/presentations/katchaounov_timour.pdf>`_.

146

147

148

*******

149

MODULES

150

*******

151

152

153

This program is actually a runnable module, not just an ordinary Perl script.

154

In fact, there are two modules embedded in it. This makes unit testing easy,

155

but it also makes it easy for you to use the parsing and tree-building

156

functionality if you want.

157

158

The ExplainParser package accepts a string and parses whatever it thinks looks

159

like EXPLAIN output from it. The synopsis is as follows:

160

161

162

.. code-block:: perl

163

164

require "pt-visual-explain";

165

my $p = ExplainParser->new();

166

my $rows = $p->parse("some text");

167

# $rows is an arrayref of hashrefs.

168

169

170

The ExplainTree package accepts a set of rows and turns it into a tree. For

171

convenience, you can also have it delegate to ExplainParser and parse text for

172

you. Here's the synopsis:

173

174

175

.. code-block:: perl

176

177

require "pt-visual-explain";

178

my $e = ExplainTree->new();

179

my $tree = $e->parse("some text", \%options);

180

my $output = $e->pretty_print($tree);

181

print $tree;

182

183

184

185

*********

186

ALGORITHM

187

*********

188

189

190

This section explains the algorithm that converts EXPLAIN into a tree. You may

191

be interested in reading this if you want to understand EXPLAIN more fully, or

192

trying to figure out how this works, but otherwise this section will probably

193

not make your life richer.

194

195

The tree can be built by examining the id, select_type, and table columns of

196

each row. Here's what I know about them:

197

198

The id column is the sequential number of the select. This does not indicate

199

nesting; it just comes from counting SELECT from the left of the SQL statement.

200

It's like capturing parentheses in a regular expression. A UNION RESULT row

201

doesn't have an id, because it isn't a SELECT. The source code actually refers

202

to UNIONs as a fake_lex, as I recall.

203

204

If two adjacent rows have the same id value, they are joined with the standard

205

single-sweep multi-join method.

206

207

The select_type column tells a) that a new sub-scope has opened b) what kind

208

of relationship the row has to the previous row c) what kind of operation the

209

row represents.

210

211

212

213

214

SIMPLE means there are no subqueries or unions in the whole query.

215

216

217

218

219

220

PRIMARY means there are, but this is the outermost SELECT.

221

222

223

224

225

226

[DEPENDENT] UNION means this result is UNIONed with the previous result (not

227

row; a result might encompass more than one row).

228

229

230

231

232

233

UNION RESULT terminates a set of UNIONed results.

234

235

236

237

238

239

[DEPENDENT|UNCACHEABLE] SUBQUERY means a new sub-scope is opening. This is the

240

kind of subquery that happens in a WHERE clause, SELECT list or whatnot; it does

241

not return a so-called "derived table."

242

243

244

245

246

247

DERIVED is a subquery in the FROM clause.

248

249

250

251

Tables that are JOINed all have the same select_type. For example, if you JOIN

252

three tables inside a dependent subquery, they'll all say the same thing:

253

DEPENDENT SUBQUERY.

254

255

The table column usually specifies the table name or alias, but may also say

256

<derivedN> or <unionN,N...N>. If it says <derivedN>, the row represents an

257

access to the temporary table that holds the result of the subquery whose id is

258

N. If it says <unionN,..N> it's the same thing, but it refers to the results it

259

UNIONs together.

260

261

Finally, order matters. If a row's id is less than the one before it, I think

262

that means it is dependent on something other than the one before it. For

263

example,

264

265

266

.. code-block:: perl

267

268

explain select

269

(select 1 from sakila.film),

270

(select 2 from sakila.film_actor),

271

(select 3 from sakila.actor);

272

273

| id | select_type | table |

274

+----+-------------+------------+

275

| 1 | PRIMARY | NULL |

276

| 4 | SUBQUERY | actor |

277

| 3 | SUBQUERY | film_actor |

278

| 2 | SUBQUERY | film |

279

280

281

If the results were in order 2-3-4, I think that would mean 3 is a subquery of

282

2, 4 is a subquery of 3. As it is, this means 4 is a subquery of the nearest

283

previous recent row with a smaller id, which is 1. Likewise for 3 and 2.

284

285

This structure is hard to programatically build into a tree for the same reason

286

it's hard to understand by inspection: there are both forward and backward

287

references. <derivedN> is a forward reference to selectN, while <unionM,N> is a

288

backward reference to selectM and selectN. That makes recursion and other

289

tree-building algorithms hard to get right (NOTE: after implementation, I now

290

see how it would be possible to deal with both forward and backward references,

291

but I have no motivation to change something that works). Consider the

292

following:

293

294

295

.. code-block:: perl

296

297

select * from (

298

select 1 from sakila.actor as actor_1

299

union

300

select 1 from sakila.actor as actor_2

301

) as der_1

302

union

303

select * from (

304

select 1 from sakila.actor as actor_3

305

union all

306

select 1 from sakila.actor as actor_4

307

) as der_2;

308

309

| id | select_type | table |

310

+------+--------------+------------+

311

| 1 | PRIMARY | <derived2> |

312

| 2 | DERIVED | actor_1 |

313

| 3 | UNION | actor_2 |

314

| NULL | UNION RESULT | <union2,3> |

315

| 4 | UNION | <derived5> |

316

| 5 | DERIVED | actor_3 |

317

| 6 | UNION | actor_4 |

318

| NULL | UNION RESULT | <union5,6> |

319

| NULL | UNION RESULT | <union1,4> |

320

321

322

This would be a lot easier to work with if it looked like this (I've

323

bracketed the id on rows I moved):

324

325

326

.. code-block:: perl

327

328

| id | select_type | table |

329

+------+--------------+------------+

330

| [1] | UNION RESULT | <union1,4> |

331

| 1 | PRIMARY | <derived2> |

332

| [2] | UNION RESULT | <union2,3> |

333

| 2 | DERIVED | actor_1 |

334

| 3 | UNION | actor_2 |

335

| 4 | UNION | <derived5> |

336

| [5] | UNION RESULT | <union5,6> |

337

| 5 | DERIVED | actor_3 |

338

| 6 | UNION | actor_4 |

339

340

341

In fact, why not re-number all the ids, so the PRIMARY row becomes 2, and so on?

342

That would make it even easier to read. Unfortunately that would also have the

343

effect of destroying the meaning of the id column, which I think is important to

344

preserve in the final tree. Also, though it makes it easier to read, it doesn't

345

make it easier to manipulate programmatically; so it's fine to leave them

346

numbered as they are.

347

348

The goal of re-ordering is to make it easier to figure out which rows are

349

children of which rows in the execution plan. Given the reordered list and some

350

row whose table is <union...> or <derived>, it is easy to find the beginning of

351

the slice of rows that should be child nodes in the tree: you just look for the

352

first row whose ID is the same as the first number in the table.

353

354

The next question is how to find the last row that should be a child node of a

355

UNION or DERIVED. I'll start with DERIVED, because the solution makes UNION

356

easy.

357

358

Consider how MySQL numbers the SELECTs sequentially according to their position

359

in the SQL, left-to-right. Since a DERIVED table encloses everything within it

360

in a scope, which becomes a temporary table, there are only two things to think

361

about: its child subqueries and unions (if any), and its next siblings in the

362

scope that encloses it. Its children will all have an id greater than it does,

363

by definition, so any later rows with a smaller id terminate the scope.

364

365

Here's an example. The middle derived table here has a subquery and a UNION to

366

make it a little more complex for the example.

367

368

369

.. code-block:: perl

370

371

explain select 1

372

from (

373

select film_id from sakila.film limit 1

374

) as der_1

375

join (

376

select film_id, actor_id, (select count(*) from sakila.rental) as r

377

from sakila.film_actor limit 1

378

union all

379

select 1, 1, 1 from sakila.film_actor as dummy

380

) as der_2 using (film_id)

381

join (

382

select actor_id from sakila.actor limit 1

383

) as der_3 using (actor_id);

384

385

386

Here's the output of EXPLAIN:

387

388

389

.. code-block:: perl

390

391

| id | select_type | table |

392

| 1 | PRIMARY | <derived2> |

393

| 1 | PRIMARY | <derived6> |

394

| 1 | PRIMARY | <derived3> |

395

| 6 | DERIVED | actor |

396

| 3 | DERIVED | film_actor |

397

| 4 | SUBQUERY | rental |

398

| 5 | UNION | dummy |

399

| NULL | UNION RESULT | <union3,5> |

400

| 2 | DERIVED | film |

401

402

403

The siblings all have id 1, and the middle one I care about is derived3.

404

(Notice MySQL doesn't execute them in the order I defined them, which is fine).

405

Now notice that MySQL prints out the rows in the opposite order I defined the

406

subqueries: 6, 3, 2. It always seems to do this, and there might be other

407

methods of finding the scope boundaries including looking for the lower boundary

408

of the next largest sibling, but this is a good enough heuristic. I am forced

409

to rely on it for non-DERIVED subqueries, so I rely on it here too. Therefore,

410

I decide that everything greater than or equal to 3 belongs to the DERIVED

411

scope.

412

413

The rule for UNION is simple: they consume the entire enclosing scope, and to

414

find the component parts of each one, you find each part's beginning as referred

415

to in the <unionN,...> definition, and its end is either just before the next

416

one, or if it's the last part, the end is the end of the scope.

417

418

This is only simple because UNION consumes the entire scope, which is either the

419

entire statement, or the scope of a DERIVED table. This is because a UNION

420

cannot be a sibling of another UNION or a table, DERIVED or not. (Try writing

421

such a statement if you don't see it intuitively). Therefore, you can just find

422

the enclosing scope's boundaries, and the rest is easy. Notice in the example

423

above, the UNION is over <union3,5>, which includes the row with id 4 -- it

424

includes every row between 3 and 5.

425

426

Finally, there are non-derived subqueries to deal with as well. In this case I

427

can't look at siblings to find the end of the scope as I did for DERIVED. I

428

have to trust that MySQL executes depth-first. Here's an example:

429

430

431

.. code-block:: perl

432

433

explain

434

select actor_id,

435

(

436

select count(film_id)

437

+ (select count(*) from sakila.film)

438

from sakila.film join sakila.film_actor using(film_id)

439

where exists(

440

select * from sakila.actor

441

where sakila.actor.actor_id = sakila.film_actor.actor_id

442

)

443

)

444

from sakila.actor;

445

446

| id | select_type | table |

447

| 1 | PRIMARY | actor |

448

| 2 | SUBQUERY | film |

449

| 2 | SUBQUERY | film_actor |

450

| 4 | DEPENDENT SUBQUERY | actor |

451

| 3 | SUBQUERY | film |

452

453

454

In order, the tree should be built like this:

455

456

457

458

459

See row 1.

460

461

462

463

464

465

See row 2. It's a higher id than 1, so it's a subquery, along with every other

466

row whose id is greater than 2.

467

468

469

470

471

472

Inside this scope, see 2 and 2 and JOIN them. See 4. It's a higher id than 2,

473

so it's again a subquery; recurse. After that, see 3, which is also higher;

474

recurse.

475

476

477

478

But the only reason the nested subquery didn't include select 3 is because

479

select 4 came first. In other words, if EXPLAIN looked like this,

480

481

482

.. code-block:: perl

483

484

| id | select_type | table |

485

| 1 | PRIMARY | actor |

486

| 2 | SUBQUERY | film |

487

| 2 | SUBQUERY | film_actor |

488

| 3 | SUBQUERY | film |

489

| 4 | DEPENDENT SUBQUERY | actor |

490

491

492

I would be forced to assume upon seeing select 3 that select 4 is a subquery

493

of it, rather than just being the next sibling in the enclosing scope. If this

494

is ever wrong, then the algorithm is wrong, and I don't see what could be done

495

about it.

496

497

UNION is a little more complicated than just "the entire scope is a UNION,"

498

because the UNION might itself be inside an enclosing scope that's only

499

indicated by the first item inside the UNION. There are only three kinds of

500

enclosing scopes: UNION, DERIVED, and SUBQUERY. A UNION can't enclose a UNION,

501

and a DERIVED has its own "scope markers," but a SUBQUERY can wholly enclose a

502

UNION, like this strange example on the empty table t1:

503

504

505

.. code-block:: perl

506

507

explain select * from t1 where not exists(

508

(select t11.i from t1 t11) union (select t12.i from t1 t12));

509

510

511

+------+--------------+------------+--------------------------------+

512

| 1 | PRIMARY | t1 | const row not found |

513

514

515

| 4 | UNION | t12 | const row not found |

516

517

518

519

The UNION's backward references might make it look like the UNION encloses the

520

subquery, but studying the query makes it clear this isn't the case. So when a

521

UNION's first row says SUBQUERY, it is this special case.

522

523

By the way, I don't fully understand this query plan; there are 4 numbered

524

SELECT in the plan, but only 3 in the query. The parens around the UNIONs are

525

meaningful. Removing them will make the EXPLAIN different. Please tell me how

526

and why this works if you know.

527

528

Armed with this knowledge, it's possible to use recursion to turn the

529

parent-child relationship between all the rows into a tree representing the

530

execution plan.

531

532

MySQL prints the rows in execution order, even the forward and backward

533

references. At any given scope, the rows are processed as a left-deep tree.

534

MySQL does not do "bushy" execution plans. It begins with a table, finds a

535

matching row in the next table, and continues till the last table, when it emits

536

a row. When it runs out, it backtracks till it can find the next row and

537

repeats. There are subtleties of course, but this is the basic plan. This is

538

why MySQL transforms all RIGHT OUTER JOINs into LEFT OUTER JOINs and cannot do

539

FULL OUTER JOIN.

540

541

This means in any given scope, say

542

543

544

.. code-block:: perl

545

546

| id | select_type | table |

547

| 1 | SIMPLE | tbl1 |

548

| 1 | SIMPLE | tbl2 |

549

| 1 | SIMPLE | tbl3 |

550

551

552

The execution plan looks like a depth-first traversal of this tree:

553

554

555

.. code-block:: perl

556

557

JOIN

558

/ \

559

JOIN tbl3

560

/ \

561

tbl1 tbl2

562

563

564

The JOIN might not be a JOIN. It might be a subquery, for example. This comes

565

from the type column of EXPLAIN. The documentation says this is a "join type,"

566

but I think "access type" is more accurate, because it's "how MySQL accesses

567

rows."

568

569

pt-visual-explain decorates the tree significantly more than just turning

570

rows into nodes. Each node may get a series of transformations that turn it

571

into a subtree of more than one node. For example, an index scan not marked

572

with 'Using index' must do a bookmark lookup into the table rows; that is a

573

three-node subtree. However, after the above node-ordering and scoping stuff,

574

the rest of the process is pretty simple.

575

576

577

*******

578

OPTIONS

579

*******

580

581

582

This tool accepts additional command-line arguments. Refer to the

583

"SYNOPSIS" and usage information for details.

584

585

586

--ask-pass

587

588

Prompt for a password when connecting to MySQL.

589

590

591

592

--charset

593

594

short form: -A; type: string

595

596

Default character set. If the value is utf8, sets Perl's binmode on

597

STDOUT to utf8, passes the mysql_enable_utf8 option to DBD::mysql, and

598

runs SET NAMES UTF8 after connecting to MySQL. Any other value sets

599

binmode on STDOUT without the utf8 layer, and runs SET NAMES after

600

connecting to MySQL.

601

602

603

604

--clustered-pk

605

606

Assume that PRIMARY KEY index accesses don't need to do a bookmark lookup to

607

retrieve rows. This is the case for InnoDB.

608

609

610

611

--config

612

613

type: Array

614

615

Read this comma-separated list of config files; if specified, this must be the

616

first option on the command line.

617

618

619

620

--connect

621

622

Treat input as a query, and obtain EXPLAIN output by connecting to a MySQL

623

instance and running EXPLAIN on the query. When this option is given,

624

pt-visual-explain uses the other connection-specific options such as

625

"--user" to connect to the MySQL instance. If you have a .my.cnf file,

626

it will read it, so you may not need to specify any connection-specific

627

options.

628

629

630

631

--database

632

633

short form: -D; type: string

634

635

Connect to this database.

636

637

638

639

--defaults-file

640

641

short form: -F; type: string

642

643

Only read mysql options from the given file. You must give an absolute

644

pathname.

645

646

647

648

--format

649

650

type: string; default: tree

651

652

Set output format.

653

654

The default is a terse pretty-printed tree. The valid values are:

655

656

657

.. code-block:: perl

658

659

value meaning

660

===== =======

661

tree Pretty-printed terse tree.

662

dump Data::Dumper output (see L<Data::Dumper> for more).

663

664

665

666

667

--help

668

669

Show help and exit.

670

671

672

673

--host

674

675

short form: -h; type: string

676

677

Connect to host.

678

679

680

681

--password

682

683

short form: -p; type: string

684

685

Password to use when connecting.

686

687

688

689

--pid

690

691

type: string

692

693

Create the given PID file. The file contains the process ID of the script.

694

The PID file is removed when the script exits. Before starting, the script

695

checks if the PID file already exists. If it does not, then the script creates

696

and writes its own PID to it. If it does, then the script checks the following:

697

if the file contains a PID and a process is running with that PID, then

698

the script dies; or, if there is no process running with that PID, then the

699

script overwrites the file with its own PID and starts; else, if the file

700

contains no PID, then the script dies.

701

702

703

704

--port

705

706

short form: -P; type: int

707

708

Port number to use for connection.

709

710

711

712

--set-vars

713

714

type: string; default: wait_timeout=10000

715

716

Set these MySQL variables. Immediately after connecting to MySQL, this

717

string will be appended to SET and executed.

718

719

720

721

--socket

722

723

short form: -S; type: string

724

725

Socket file to use for connection.

726

727

728

729

--user

730

731

short form: -u; type: string

732

733

User for login if not current user.

734

735

736

737

--version

738

739

Show version and exit.

740

741

742

743

744

***********

745

DSN OPTIONS

746

***********

747

748

749

These DSN options are used to create a DSN. Each option is given like

750

\ ``option=value``\ . The options are case-sensitive, so P and p are not the

751

same option. There cannot be whitespace before or after the \ ``=``\ and

752

if the value contains whitespace it must be quoted. DSN options are

753

comma-separated. See the percona-toolkit manpage for full details.

754

755

756

\* A

757

758

dsn: charset; copy: yes

759

760

Default character set.

761

762

763

764

\* D

765

766

dsn: database; copy: yes

767

768

Default database.

769

770

771

772

\* F

773

774

dsn: mysql_read_default_file; copy: yes

775

776

Only read default options from the given file

777

778

779

780

\* h

781

782

dsn: host; copy: yes

783

784

Connect to host.

785

786

787

788

\* p

789

790

dsn: password; copy: yes

791

792

Password to use when connecting.

793

794

795

796

\* P

797

798

dsn: port; copy: yes

799

800

Port number to use for connection.

801

802

803

804

\* S

805

806

dsn: mysql_socket; copy: yes

807

808

Socket file to use for connection.

809

810

811

812

\* u

813

814

dsn: user; copy: yes

815

816

User for login if not current user.

817

818

819

820

821

***********

822

DOWNLOADING

823

***********

824

825

826

Visit `http://www.percona.com/software/ <http://www.percona.com/software/>`_ to download the latest release of

827

Percona Toolkit. Or, to get the latest release from the command line:

828

829

830

.. code-block:: perl

831

832

wget percona.com/latest/percona-toolkit/PKG

833

834

835

Replace \ ``PKG``\ with \ ``tar``\ , \ ``rpm``\ , or \ ``deb``\ to download the package in that

836

format. You can also get individual tools from the latest release:

837

838

839

.. code-block:: perl

840

841

wget percona.com/latest/percona-toolkit/TOOL

842

843

844

Replace \ ``TOOL``\ with the name of any tool.

845

846

847

***********

848

ENVIRONMENT

849

***********

850

851

852

The environment variable \ ``PTDEBUG``\ enables verbose debugging output to STDERR.

853

To enable debugging and capture all output to a file, run the tool like:

854

855

856

.. code-block:: perl

857

858

PTDEBUG=1 pt-visual-explain ... > FILE 2>&1

859

860

861

Be careful: debugging output is voluminous and can generate several megabytes

862

of output.

863

864

865

*******************

866

SYSTEM REQUIREMENTS

867

*******************

868

869

870

You need Perl, DBI, DBD::mysql, and some core packages that ought to be

871

installed in any reasonably new version of Perl.

872

873

874

****

875

BUGS

876

****

877

878

879

For a list of known bugs, see `http://www.percona.com/bugs/pt-visual-explain <http://www.percona.com/bugs/pt-visual-explain>`_.

880

881

Please report bugs at `https://bugs.launchpad.net/percona-toolkit <https://bugs.launchpad.net/percona-toolkit>`_.

882

Include the following information in your bug report:

883

884

885

\* Complete command-line used to run the tool

886

887

888

889

\* Tool "--version"

890

891

892

893

\* MySQL version of all servers involved

894

895

896

897

\* Output from the tool including STDERR

898

899

900

901

\* Input files (log/dump/config files, etc.)

902

903

904

905

If possible, include debugging output by running the tool with \ ``PTDEBUG``\ ;

906

see "ENVIRONMENT".

907

908

909

*******

910

AUTHORS

911

*******

912

913

914

Baron Schwartz

915

916

917

*********************

918

ABOUT PERCONA TOOLKIT

919

*********************

920

921

922

This tool is part of Percona Toolkit, a collection of advanced command-line

923

tools developed by Percona for MySQL support and consulting. Percona Toolkit

924

was forked from two projects in June, 2011: Maatkit and Aspersa. Those

925

projects were created by Baron Schwartz and developed primarily by him and

926

Daniel Nichter, both of whom are employed by Percona. Visit

927

`http://www.percona.com/software/ <http://www.percona.com/software/>`_ for more software developed by Percona.

928

929

930

********************************

931

932

********************************

933

934

935

936

Feedback and improvements are welcome.

937

938

THIS PROGRAM IS PROVIDED "AS IS" AND WITHOUT ANY EXPRESS OR IMPLIED

939

WARRANTIES, INCLUDING, WITHOUT LIMITATION, THE IMPLIED WARRANTIES OF

940

MERCHANTABILITY AND FITNESS FOR A PARTICULAR PURPOSE.

941

942

This program is free software; you can redistribute it and/or modify it under

943

the terms of the GNU General Public License as published by the Free Software

944

Foundation, version 2; OR the Perl Artistic License. On UNIX and similar

945

systems, you can issue \`man perlgpl' or \`man perlartistic' to read these

946

licenses.

947

948

You should have received a copy of the GNU General Public License along with

949

this program; if not, write to the Free Software Foundation, Inc., 59 Temple

950

Place, Suite 330, Boston, MA 02111-1307 USA.

951

952

953

*******

954

VERSION

955

*******

956

957

958

Percona Toolkit v1.0.0 released 2011-08-01

959

Older »