Commit 3fa0f9f
committed
Optimize BAM tag decoding: single-pass string reads and skip redundant validation.
Two targeted optimizations to the BAM tag decoding path:
1. BinaryTagCodec.readNullTerminatedString: replace the double-pass
approach (scan forward for null, reset, re-read) with a single-pass
scan over the backing byte array, eliminating the intermediate byte[]
allocation per string tag.
2. SAMBinaryTagAndValue/SAMBinaryTagAndUnsignedArrayValue: add
package-private constructors that skip isAllowedAttributeValue()
validation, used from BinaryTagCodec.readTags() where the value
types are known to be valid from the BAM type codes.
Profiling with eager decode showed tag decoding at 18.2% of CPU.
Benchmarking on a 3.9GB BAM (52M records) showed a ~3% improvement
in the eager-decode path (40.0s -> 38.9s) with no regression in the
lazy path.1 parent f60723e commit 3fa0f9f
File tree
3 files changed
+25
-18
lines changed- src/main/java/htsjdk/samtools
3 files changed
+25
-18
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
279 | 279 | | |
280 | 280 | | |
281 | 281 | | |
282 | | - | |
| 282 | + | |
283 | 283 | | |
284 | 284 | | |
285 | | - | |
286 | | - | |
| 285 | + | |
| 286 | + | |
287 | 287 | | |
288 | 288 | | |
289 | 289 | | |
| |||
294 | 294 | | |
295 | 295 | | |
296 | 296 | | |
297 | | - | |
| 297 | + | |
298 | 298 | | |
299 | 299 | | |
300 | 300 | | |
| |||
407 | 407 | | |
408 | 408 | | |
409 | 409 | | |
410 | | - | |
411 | | - | |
412 | | - | |
413 | | - | |
414 | | - | |
415 | | - | |
416 | | - | |
417 | | - | |
418 | | - | |
419 | | - | |
420 | | - | |
421 | | - | |
422 | | - | |
423 | | - | |
| 410 | + | |
| 411 | + | |
| 412 | + | |
| 413 | + | |
| 414 | + | |
| 415 | + | |
| 416 | + | |
424 | 417 | | |
425 | 418 | | |
Lines changed: 5 additions & 0 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
39 | 39 | | |
40 | 40 | | |
41 | 41 | | |
| 42 | + | |
| 43 | + | |
| 44 | + | |
| 45 | + | |
| 46 | + | |
42 | 47 | | |
43 | 48 | | |
44 | 49 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
65 | 65 | | |
66 | 66 | | |
67 | 67 | | |
| 68 | + | |
| 69 | + | |
| 70 | + | |
| 71 | + | |
| 72 | + | |
| 73 | + | |
| 74 | + | |
| 75 | + | |
| 76 | + | |
68 | 77 | | |
69 | 78 | | |
70 | 79 | | |
| |||
0 commit comments