~ubuntu-branches/debian/sid/octave3.0/sid

— Loadable Function: [<var>s</var>, <var>e</var>, <var>te</var>, <var>m</var>, <var>t</var>, <var>nm</var>] = regexp (<var>str, pat</var>)<var><a name="index-regexp-283"></a></var>

220

— Loadable Function: [...] = regexp (<var>str, pat, opts, ...</var>)<var><a name="index-regexp-284"></a></var>

221

222

Regular expression string matching. Matches <var>pat</var> in <var>str</var> and

223

returns the position and matching substrings or empty values if there are

224

none.

225

226

The matched pattern <var>pat</var> can include any of the standard regex

227

operators, including:

228

229

<dl>

230

<dt><code>.</code><dd>Match any character

231

<dt><code>* + ? {}</code><dd>Repetition operators, representing

232

<dl>

233

<dt><code>*</code><dd>Match zero or more times

234

<dt><code>+</code><dd>Match one or more times

235

<dt><code>?</code><dd>Match zero or one times

236

<dt><code>{}</code><dd>Match range operator, which is of the form <code>{</code><var>n</var><code>}</code> to match exactly

237

<var>n</var> times, <code>{</code><var>m</var><code>,}</code> to match <var>m</var> or more times,

238

<code>{</code><var>m</var><code>,</code><var>n</var><code>}</code> to match between <var>m</var> and <var>n</var> times.

239

</dl>

240

<dt><code>[...] [^...]</code><dd>List operators, where for example <code>[ab]c</code> matches <code>ac</code> and <code>bc</code>

241

<dt><code>()</code><dd>Grouping operator

242

<dt><code>|</code><dd>Alternation operator. Match one of a choice of regular expressions. The

243

alternatives must be delimited by the grouping operator <code>()</code> above

244

<dt><code>^ $</code><dd>Anchoring operator. <code>^</code> matches the start of the string <var>str</var> and

245

<code>$</code> the end

246

</dl>

247

248

In addition the following escaped characters have special meaning. It should

249

be noted that it is recommended to quote <var>pat</var> in single quotes rather

250

than double quotes, to avoid the escape sequences being interpreted by octave

251

before being passed to <code>regexp</code>.

252

253

<dl>

254

<dt><code>\b</code><dd>Match a word boundary

255

<dt><code>\B</code><dd>Match within a word

256

<dt><code>\w</code><dd>Matches any word character

257

<dt><code>\W</code><dd>Matches any non word character

258

<dt><code>\<</code><dd>Matches the beginning of a word

259

<dt><code>\></code><dd>Matches the end of a word

260

<dt><code>\s</code><dd>Matches any whitespace character

261

<dt><code>\S</code><dd>Matches any non whitespace character

262

<dt><code>\d</code><dd>Matches any digit

263

<dt><code>\D</code><dd>Matches any non-digit

264

</dl>

265

266

The outputs of <code>regexp</code> by default are in the order as given below

267

268

<dl>

269

<dt><var>s</var><dd>The start indices of each of the matching substrings

270

271

<dt><var>e</var><dd>The end indices of each matching substring

272

273

<dt><var>te</var><dd>The extents of each of the matched token surrounded by <code>(...)</code> in

274

<var>pat</var>.

275

276

<dt><var>m</var><dd>A cell array of the text of each match.

277

278

<dt><var>t</var><dd>A cell array of the text of each token matched.

279

280

<dt><var>nm</var><dd>A structure containing the text of each matched named token, with the name

281

being used as the fieldname. A named token is denoted as

282

283

</dl>

284

285

Particular output arguments or the order of the output arguments can be

286

selected by additional <var>opts</var> arguments. These are strings and the

287

correspondence between the output arguments and the optional argument

288

are

289

290

<table summary=""><tr align="left"><td valign="top" width="20%"></td><td valign="top" width="30%">'start' </td><td valign="top" width="30%"><var>s</var> </td><td valign="top" width="20%">

291

</td></tr><tr align="left"><td valign="top" width="20%"></td><td valign="top" width="30%">'end' </td><td valign="top" width="30%"><var>e</var> </td><td valign="top" width="20%">

292

</td></tr><tr align="left"><td valign="top" width="20%"></td><td valign="top" width="30%">'tokenExtents' </td><td valign="top" width="30%"><var>te</var> </td><td valign="top" width="20%">

293

</td></tr><tr align="left"><td valign="top" width="20%"></td><td valign="top" width="30%">'match' </td><td valign="top" width="30%"><var>m</var> </td><td valign="top" width="20%">

294

</td></tr><tr align="left"><td valign="top" width="20%"></td><td valign="top" width="30%">'tokens' </td><td valign="top" width="30%"><var>t</var> </td><td valign="top" width="20%">

295

</td></tr><tr align="left"><td valign="top" width="20%"></td><td valign="top" width="30%">'names' </td><td valign="top" width="30%"><var>nm</var> </td><td valign="top" width="20%">

296

</td></tr></table>

297

298

A further optional argument is 'once', that limits the number of returned

299

matches to the first match. Additional arguments are

300

301

<dl>

302

<dt>matchcase<dd>Make the matching case sensitive.

303

<dt>ignorecase<dd>Make the matching case insensitive.

304

<dt>stringanchors<dd>Match the anchor characters at the beginning and end of the string.

305

<dt>lineanchors<dd>Match the anchor characters at the beginning and end of the line.

306

<dt>dotall<dd>The character <code>.</code> matches the newline character.

307

<dt>dotexceptnewline<dd>The character <code>.</code> matches all but the newline character.

308

<dt>freespacing<dd>The pattern can include arbitrary whitespace and comments starting with

309

<code>#</code>.

310

<dt>literalspacing<dd>The pattern is taken literally.

311

</dl>

312

</blockquote></div>

313

314

315

316

317

— Loadable Function: [<var>s</var>, <var>e</var>, <var>te</var>, <var>m</var>, <var>t</var>, <var>nm</var>] = regexpi (<var>str, pat</var>)<var><a name="index-regexpi-285"></a></var>

318

— Loadable Function: [...] = regexpi (<var>str, pat, opts, ...</var>)<var><a name="index-regexpi-286"></a></var>

319

320

Case insensitive regular expression string matching. Matches <var>pat</var> in

321

<var>str</var> and returns the position and matching substrings or empty values

322

if there are none. See <code>regexp</code> for more details

323

</blockquote></div>

324

325

326

327

328

— Loadable Function: <var>string</var> = regexprep (<var>string, pat, repstr, options</var>)<var><a name="index-regexprep-287"></a></var>

329

<blockquote>Replace matches of <var>pat</var> in <var>string</var> with <var>repstr</var>.

330

331

The replacement can contain <code>$i</code>, which substitutes

332

for the ith set of parentheses in the match string. E.g.,

333

334

regexprep("Bill Dunn",'(\w+) (\w+)','$2, $1')

335

336

</pre>

337

returns "Dunn, Bill"

338

339

<var>options</var> may be zero or more of

340

<dl>

341

<dt>‘<samp>once</samp>’<dd>Replace only the first occurrence of <var>pat</var> in the result.

342

343

<dt>‘<samp>warnings</samp>’<dd>This option is present for compatibility but is ignored.

344

345

<dt>‘<samp>ignorecase or matchcase</samp>’<dd>Ignore case for the pattern matching (see <code>regexpi</code>).

346

Alternatively, use (?i) or (?-i) in the pattern.

347

348

<dt>‘<samp>lineanchors and stringanchors</samp>’<dd>Whether characters ^ and $ match the beginning and ending of lines.

349

Alternatively, use (?m) or (?-m) in the pattern.

350

351

<dt>‘<samp>dotexceptnewline and dotall</samp>’<dd>Whether . matches newlines in the string.

352

Alternatively, use (?s) or (?-s) in the pattern.

353

354

<dt>‘<samp>freespacing or literalspacing</samp>’<dd>Whether whitespace and # comments can be used to make the regular expression more readable.

355

Alternatively, use (?x) or (?-x) in the pattern.

356

357

</dl>

358

359

360

361

</pre>

362

See also: regexp,regexpi.

363

</blockquote></div>

364

365

</body></html>

366

Older »