The 6502 had a couple of unique selling points compared to its predecessor the 6800, and the decimal mode was crucial because it was patent protected. It saves an instruction and a couple of cycles from each byte of decimal arithmetic, and removes the half-carry from the status byte - it also works for both addition and subtraction.
Decimal mode only affects ADC and SBC instructions, and on the NMOS 6502 only usefully sets the C flag. The N, V and Z flags are set, but don't correspond to what you might expect from a 10's complement decimal operation.
Many (software) emulators have decimal mode correct, and many have it incorrect or missing. The same is true for various re-implemented 6502 cores. Because the CMOS 6502 and later parts set the flags differently from the NMOS 6502, correctness can only be judged relative to a specific part.
Bruce Clark's tutorial contains a test program which can test all the flags (for the various CPU models) and will report the first failing case. Using this, we can collect some specific 'difficult' cases for use on a slow model, or for a rapid test of new code, or for illustration of the 6502 datapath in action. Some of the following tests are now found in the py65 test suite
We need a list of interesting signals to probe to observe the decimal mode adjustments. (The presently released JSSim doesn't have C34 named, but it will on next update)
The two operands, and the carry in, are added as a pair of nibbles. The carry-out from bit3 is adjusted in decimal mode, but only for ADC. So the ALU is not a binary byte-wide ALU with a decimal adjustment, it is a pair of binary nibble ALUs with a decimal adjustment. In the tests, we don't specifically need to test that carry-in is used (except to prove that carry-out is changing the carry bit, if we have that freedom)
(For other test suites, see 6502TestPrograms)
Tests for ADC
- 00 + 00 and C=0 gives 00 and N=0 V=0 Z=1 C=0 (simulate)
- 79 + 00 and C=1 gives 80 and N=1 V=1 Z=0 C=0 (simulate)
- 24 + 56 and C=0 gives 80 and N=1 V=1 Z=0 C=0 (simulate)
- 93 + 82 and C=0 gives 75 and N=0 V=1 Z=0 C=1 (simulate)
- 89 + 76 and C=0 gives 55 and N=0 V=0 Z=0 C=1 (simulate)
- 89 + 76 and C=1 gives 56 and N=0 V=0 Z=1 C=1 (simulate)
- 80 + f0 and C=0 gives d0 and N=0 V=1 Z=0 C=1 (simulate)
- 80 + fa and C=0 gives e0 and N=1 V=0 Z=0 C=1 (simulate)
- 2f + 4f and C=0 gives 74 and N=0 V=0 Z=0 C=0 (simulate)
- 6f + 00 and C=1 gives 76 and N=0 V=0 Z=0 C=0 (simulate)
Tests for SBC
- 00 - 00 and C=0 gives 99 and N=1 V=0 Z=0 C=0 (simulate)
- 00 - 00 and C=1 gives 00 and N=0 V=0 Z=1 C=1 (simulate)
- 00 - 01 and C=1 gives 99 and N=1 V=0 Z=0 C=0 (simulate)
- 0a - 00 and C=1 gives 0a and N=0 V=0 Z=0 C=1 (simulate)
- 0b - 00 and C=0 gives 0a and N=0 V=0 Z=0 C=1 (simulate)
- 9a - 00 and C=1 gives 9a and N=1 V=0 Z=0 C=1 (simulate)
- 9b - 00 and C=0 gives 9a and N=1 V=0 Z=0 C=1 (simulate)
One form of test program sets all the input flags using PLP:
lda #$c8 pha lda #$00 plp adc #$00 nop
and to calculate what that initial value of PLP should be, we can use a bit more code
php pla eor #$c3 // #$c2 if we don't want to invert the carry nop
Decimal mode and the NES' RP2A03G
The CPU in the NES' RP2A03G does not implement decimal mode for ADC and SBC operations, but it does correctly handle the setting and clearing of the D flag.
In this blog post by Nathan Altice, Brian Bagnall’s "On the Edge: The Spectacular Rise and Fall of Commodore (2006)" is quoted:
[Commodore 64 programmer] Robert Russell investigated the NES, along with one of the original 6502 engineers, Will Mathis. “I remember we had the chip designer of the 6502,” recalls Russell. “He scraped the [NES] chip down to the die and took pictures.”
The excavation amazed Russell. “The Nintendo core processor was a 6502 designed with the patented technology scraped off,” says Russell. “We actually skimmed off the top of the chip inside of it to see what it was, and it was exactly a 6502. We looked at where we had the patents and they had gone in and deleted the circuitry where our patents were.”
With visual6502 and images from Quietust's investigation of the 2A03 we can see that a small number of changes, only to the polysilicon mask, disable the decimal adjustment by removing 5 transistors. When poly shapes are deleted, the former source and drain of transistor become contiguous, so the effect is of shorting the transistor, or making it permanently on. (These are pulldown transistors, and it's normal for them to be on, although they would typically have a 10k resistance. Shorting them will cause some additional power dissipation from the corresponding pullup but presumably insigificant compared to the thousands of other pullups which will be active at any give time.)
The first note of the difference is this odd contact cut which has no surrounding poly or active: see this image - which turns out to be due to the removal of the t1329 transistor. It's one of two transistors normally used by the "dpc22_#DSA" signal as a pulldown to effect decimal adjust during subtraction. The other is t3212 which is just off the top of the two images linked above.
In the case of t2556 the control line runs through the transistor - but still the poly is removed locally with a minimal change. That leaves some floating poly, but as the other two transistors don't exist any more, it's irrelevant.
With these 5 transistors removed, there was no need to change the decode ROM and no need to change the status register.