~rdoering/ubuntu/karmic/erlang/fix-535090

<c>Option = {header, HeaderLength} | {format, Format} | {order, Order} | {unique, bool()} | {tmpdir, TempDirectory} | {compressed, bool()} | {size, Size} | {no_files, NoFiles}</c>

138

</item>

139

<item>

140

<c>HeaderLength = int() > 0</c>

141

</item>

142

<item>

143

<c>Format = binary_term | term | binary | FormatFun</c>

144

</item>

145

<item>

146

<c>FormatFun = fun(Binary) -> Term</c>

147

</item>

148

<item>

149

<c>Order = ascending | descending | OrderFun</c>

150

</item>

151

<item>

152

<c>OrderFun = fun(Term, Term) -> bool()</c>

153

</item>

154

<item>

155

<c>TempDirectory = "" | file_name()</c>

156

</item>

157

<item>

158

159

</item>

160

<item>

161

<c>NoFiles = int() > 1</c>

162

</item>

163

</list>

164

As an alternative to sorting files, a function of one argument

165

can be given as input. When called with the argument <c>read</c>

166

the function is assumed to return <c>end_of_input</c> or

167

<c>{end_of_input, Value}}</c> when there is no more input

168

(<c>Value</c> is explained below), or <c>{Objects, Fun}</c>,

169

where <c>Objects</c> is a list of binaries or terms depending on

170

the format and <c>Fun</c> is a new input function. Any other

171

value is immediately returned as value of the current call to

172

<c>sort</c> or <c>keysort</c>. Each input function will be

173

called exactly once, and should an error occur, the last

174

function is called with the argument <c>close</c>, the reply of

175

which is ignored.

176

177

A function of one argument can be given as output. The results

178

of sorting or merging the input is collected in a non-empty

179

sequence of variable length lists of binaries or terms depending

180

on the format. The output function is called with one list at a

181

time, and is assumed to return a new output function. Any other

182

return value is immediately returned as value of the current

183

call to the sort or merge function. Each output function is

184

called exactly once. When some output function has been applied

185

to all of the results or an error occurs, the last function is

186

called with the argument <c>close</c>, and the reply is returned

187

as value of the current call to the sort or merge function. If a

188

function is given as input and the last input function returns

189

<c>{end_of_input, Value}</c>, the function given as output will

190

be called with the argument <c>{value, Value}</c>. This makes it

191

easy to initiate the sequence of output functions with a value

192

calculated by the input functions.

193

194

As an example, consider sorting the terms on a disk log file.

195

A function that reads chunks from the disk log and returns a

196

list of binaries is used as input. The results are collected in

197

a list of terms.

198

<pre>

199

sort(Log) ->

200

{ok, _} = disk_log:open([{name,Log}, {mode,read_only}]),

201

Input = input(Log, start),

202

Output = output([]),

203

Reply = file_sorter:sort(Input, Output, {format,term}),

204

ok = disk_log:close(Log),

205

Reply.

206

207

input(Log, Cont) ->

208

fun(close) ->

209

\011 ok;

210

(read) ->

211

\011 case disk_log:chunk(Log, Cont) of

212

\011\011{error, Reason} ->

213

\011\011 {error, Reason};

214

\011\011{Cont2, Terms} ->

215

\011\011 {Terms, input(Log, Cont2)};

216

\011\011{Cont2, Terms, _Badbytes} ->

217

\011\011 {Terms, input(Log, Cont2)};

218

\011\011eof ->

219

\011\011 end_of_input

220

\011 end

221

end.

222

223

output(L) ->

224

fun(close) ->

225

\011 lists:append(lists:reverse(L));

226

(Terms) ->

227

\011 output([Terms | L])

228

end. </pre>

229

Further examples of functions as input and output can be found

230

at the end of the <c>file_sorter</c> module; the <c>term</c>

231

format is implemented with functions.

232

233

The possible values of <c>Reason</c> returned when an error

234

occurs are:

235

236

<item>

237

<c>bad_object</c>, <c>{bad_object, FileName}</c>.

238

Applying the format function failed for some binary,

239

or the key(s) could not be extracted from some term.

240

</item>

241

<item>

242

<c>{bad_term, FileName}</c>. <c>io:read/2</c> failed

243

to read some term.\011

244

</item>

245

<item>

246

<c>{file_error, FileName, Reason2}</c>. See

247

<c>file(3)</c> for an explanation of <c>Reason2</c>.

248

</item>

249

<item>

250

<c>{premature_eof, FileName}</c>. End-of-file was

251

encountered inside some binary term.

252

</item>

253

</list>

254

Types

255

<pre>

256

Binary = binary()

257

FileName = file_name()

258

FileNames = [FileName]

259

ICommand = read | close

260

IReply = end_of_input | {end_of_input, Value} | {[Object], Infun} | InputReply

261

Infun = fun(ICommand) -> IReply

262

Input = FileNames | Infun

263

InputReply = Term

264

KeyPos = int() > 0 | [int() > 0]

265

OCommand = {value, Value} | [Object] | close

266

OReply = Outfun | OutputReply

267

Object = Term | Binary

268

Outfun = fun(OCommand) -> OReply

269

Output = FileName | Outfun

270

OutputReply = Term

271

Term = term()

272

Value = Term</pre>

273

</description>

274

<funcs>

275

<func>

276

<name>sort(FileName) -> Reply</name>

277

<name>sort(Input, Output) -> Reply</name>

278

<name>sort(Input, Output, Options) -> Reply</name>

279

<fsummary>Sort terms on files.</fsummary>

280

<type>

281

<v>Reply = ok | {error, Reason} | InputReply | OutputReply</v>

282

</type>

283

<desc>

284

Sorts terms on files.

285

286

<c>sort(FileName)</c> is equivalent to

287

<c>sort([FileName], FileName)</c>.

288

289

<c>sort(Input, Output)</c> is equivalent to

290

<c>sort(Input, Output, [])</c>.

291

292

293

</desc>

294

</func>

295

<func>

296

<name>keysort(KeyPos, FileName) -> Reply</name>

297

<name>keysort(KeyPos, Input, Output) -> Reply</name>

298

<name>keysort(KeyPos, Input, Output, Options) -> Reply</name>

299

<fsummary>Sort terms on files by key.</fsummary>

300

<type>

301

<v>Reply = ok | {error, Reason} | InputReply | OutputReply</v>

302

</type>

303

<desc>

304

Sorts tuples on files. The sort is performed on the

305

element(s) mentioned in <c>KeyPos</c>. If two tuples

306

compare equal on one element, next element according to

307

<c>KeyPos</c> is compared. The sort is stable.

308

309

<c>keysort(N, FileName)</c> is equivalent to

310

<c>keysort(N, [FileName], FileName)</c>.

311

312

<c>keysort(N, Input, Output)</c> is equivalent to

313

<c>keysort(N, Input, Output, [])</c>.

314

315

316

</desc>

317

</func>

318

<func>

319

<name>merge(FileNames, Output) -> Reply</name>

320

<name>merge(FileNames, Output, Options) -> Reply</name>

321

<fsummary>Merge terms on files.</fsummary>

322

<type>

323

<v>Reply = ok | {error, Reason} | OutputReply</v>

324

</type>

325

<desc>

326

Merges terms on files. Each input file is assumed to be

327

sorted.

328

329

<c>merge(FileNames, Output)</c> is equivalent to

330

<c>merge(FileNames, Output, [])</c>.

331

332

</desc>

333

</func>

334

<func>

335

<name>keymerge(KeyPos, FileNames, Output) -> Reply</name>

336

<name>keymerge(KeyPos, FileNames, Output, Options) -> Reply</name>

337

<fsummary>Merge terms on files by key.</fsummary>

338

<type>

339

<v>Reply = ok | {error, Reason} | OutputReply</v>

340

</type>

341

<desc>

342

Merges tuples on files. Each input file is assumed to be

343

sorted on key(s).

344

345

<c>keymerge(KeyPos, FileNames, Output)</c> is equivalent

346

to <c>keymerge(KeyPos, FileNames, Output, [])</c>.

347

348

349

</desc>

350

</func>

351

<func>

352

<name>check(FileName) -> Reply</name>

353

<name>check(FileNames, Options) -> Reply</name>

354

<fsummary>Check whether terms on files are sorted.</fsummary>

355

<type>

356

<v>Reply = {ok, [Result]} | {error, Reason}</v>

357

<v>Result = {FileName, TermPosition, Term}</v>

358

<v>TermPosition = int() > 1</v>

359

</type>

360

<desc>

361

Checks files for sortedness. If a file is not sorted, the

362

first out-of-order element is returned. The first term on a

363

file has position 1.

364

365

<c>check(FileName)</c> is equivalent to

366

<c>check([FileName], [])</c>.

367

368

</desc>

369

</func>

370

<func>

371

<name>keycheck(KeyPos, FileName) -> CheckReply</name>

372

<name>keycheck(KeyPos, FileNames, Options) -> Reply</name>

373

<fsummary>Check whether terms on files are sorted by key.</fsummary>

374

<type>

375

<v>Reply = {ok, [Result]} | {error, Reason}</v>

376

<v>Result = {FileName, TermPosition, Term}</v>

377

<v>TermPosition = int() > 1</v>

378

</type>

379

<desc>

380

Checks files for sortedness. If a file is not sorted, the

381

first out-of-order element is returned. The first term on a

382

file has position 1.

383

384

<c>keycheck(KeyPos, FileName)</c> is equivalent

385

to <c>keycheck(KeyPos, [FileName], [])</c>.

386

387

388

</desc>

389

</func>

390

</funcs>

391

</erlref>

392

Older »