SSD Write Endurance 25nm Vs 34nm

**~~johnw~~** · 10-16-2011, 11:08 AM

Originally Posted by Christopher

The X25-V 40 has 40GB on board, while the 320 has 40GB plus some unknown amount extra, but no one knows how much, right?

X25-V has 40GiB of flash. 40GB 320 has 48GiB of flash.

**Christopher** · 10-16-2011, 12:19 PM

Originally Posted by johnw

X25-V has 40GiB of flash. 40GB 320 has 48GiB of flash.

Thanks for that. That's pretty interesting, and probably explains the lack of much price separation between the X25-V and the 320.

I didn't know it was rocking a whole extra 8. So it has four channels with 64Gbit devices, and then one channel with a 2x64Gbit?

**sergiu** · 10-16-2011, 12:51 PM

Some numbers: 40GiB-40GB = 2813MiB. Considering 512KiB blocks =>5626 blocks ~= number of reallocated sectors until spare area reached 0 for Intel 320 drive.
Now 40GB model has 48GiB (20% more) while 80GB and 160GB models have 88GiB and 176GiB. If it is needed to have 10% more for parity, then we can assume the drive has 4GB more spare area (plus or minus 1-2GB as parity data usage is unknown) which is used right now. That means maybe 8000-10000 more spare blocks. Also, it is possible that another complete die from one package is wearing out so the number of reallocated sectors might stabilize again around 8192.

**Anvil** · 10-16-2011, 01:20 PM

Originally Posted by Ao1

Perhaps DiskMon is not reading the data correctly?

Not sure but it looks to misinterpret something, the data is written sequentially, the only part that is not seq is the small part at the end of the loop, which is limited to 100MiB per loop.

Kingston SSDNow 40GB (X25-V)

398.93TB Host writes
Reallocated sectors : 11
MD5 OK

33.36MiB/s on avg (~46 hours)

--

Corsair Force 3 120GB

01 94/50 (Raw read error rate)
05 2 (Retired Block count)
B1 57 (Wear range delta)
E6 100 (Life curve status)
E7 53 (SSD Life left)
E9 183160 (Raw writes)
F1 243906 (Host writes)

104.17MiB/s on avg (~144 hours)

power on hours : 722

144 hours = 6 days, it has written > 51TiB in one single session, that is pretty good.

I'll restart the computer and re-activate C-States shortly.

**Christopher** · 10-16-2011, 04:17 PM

Anvil,

Are you still planing on adding user adjustable parameters for the end of the loop?

I put a different Windows installation on the Mushkin. This installation was larger than than than the previous one on there, and as a result much fewer files were being generated per loop.

The last four crashes I've encountered have occurred only during the deleting files phase. I jacked down the free space to get the number of files on par with what was being generated before to see if this helps. I'm grasping at straws here, but below 10,000 files I think the pause isn't long enough for the Mushkin. I'm testing it with more files per loop now to see what happens.

**Christopher** · 10-16-2011, 07:23 PM

Mushkin Chronos Deluxe 60 Update, Day 25

05 2
Retired Block Count

B1 16
Wear Range Delta

F1 234538
Host Writes

E9 180877
NAND Writes

E6 100
Life Curve

E7 10
Life Left

Average 129.42MB/s Avg
Intel RST drivers, Asus M4g-z

574 Hours Work (28hrs since the last update)
Time 23 days 22 hours

6 GiB Minimum Free Space

Click image for larger version.

Name: Mushkin Day 25.JPG
Views: 1599
Size: 152.9 KB
ID: 121328

I think the sudden increase in Wear Range Delta is attributable to the decrease in free space I'm using to achieve more than 10,000 files per loop.

**bulanula** · 10-17-2011, 12:34 AM

The 320 still is not dead ? Has Samsung arrived yet ?

**Anvil** · 10-17-2011, 02:00 AM

Originally Posted by Christopher

Anvil,

Are you still planing on adding user adjustable parameters for the end of the loop?
...

Yes, you can adjust the pause, the delay while deleting files is still not configurable by the end-user but it is adjusted to 500ms per 500 files.

Both are now running on the ASRock Z68, C States Enabled.

Kingston SSDNow 40GB (X25-V)

400.33TB Host writes
Reallocated sectors : 12 1 up
MD5 OK

36.63MiB/s on avg (~11 hours)

--

Corsair Force 3 120GB

01 88/50 (Raw read error rate)
05 2 (Retired Block count)
B1 59 (Wear range delta)
E6 100 (Life curve status)
E7 52 (SSD Life left)
E9 186411 (Raw writes)
F1 248238 (Host writes)

106.99MiB/s on avg (~11 hours)

power on hours : 735

Wear Range Delta is as high as it was before it started decreasing, lets see what happens later today.

**bluestang** · 10-17-2011, 06:44 AM

M225->Vertex Turbo 64GB Update:

521.28 TiB (573.15 TB) total
1357.30 hours
10602 Raw Wear
118.38 MB/s avg for the last 63.81 hours (on W7 x64)
MD5 OK
C4-Erase Failure Block Count (Realloc Sectors) from 8 to 9.
(1=Bank 6/Block 2406; 2=Bank 3/Block 3925; 3=Bank 0/Block 1766; 4=Bank 0/Block 829; 5=Bank 4/Block 3191; 6=Bank 7/Block 937; 7=Bank 7/Block 1980; 8=Bank 7/Block 442; 9=Bank 7/Block 700)

Click image for larger version.

Name: CDI-M225-OCZ-VERTEX-TURBO-10.17.2011.PNG
Views: 1498
Size: 61.1 KB
ID: 121341

**Christopher** · 10-17-2011, 07:48 AM

Bumping down free space to generate over 10K files per loop seems to have done the trick -- so far.

No more crashing on file deletes... bonus.

**One_Hertz** · 10-17-2011, 07:50 AM

Originally Posted by bulanula

The 320 still is not dead ? Has Samsung arrived yet ?

320 still going strong. The reallocated sectors are still quickly rising but the SSD seems fine...

I just got the Samsung 5 minutes ago. First I have to fix the broken power connector. I probably wont get to it today, but I should get it done tomorrow. Johnw - what read tests do you want done?

After the read tests, I will attempt to write a translator algorithm onto it (which should take forever considering its state) then pop the NAND off and read it directly.

**Christopher** · 10-17-2011, 09:25 AM

One_Hertz,

Do you have to make your own Flash Translation Layer for each drive?

**~~johnw~~** · 10-17-2011, 09:32 AM

Originally Posted by One_Hertz

Johnw - what read tests do you want done?

First, see if you can read anything on the filesystem at all. Just before I broke the SATA power connector, I was unable to get the BIOS to even recognize the SSD. It reached write exhaustion on 2011-Aug-20, but I was still able to read files from it. Then I left it unpowered for a month, and tried to read it again, but the BIOS would not recognize the drive. I fiddled with it a little with no luck, then when I was trying to get it recognized on another computer was when the SATA power connector broke.

The idea was originally to check the MD5 of the ~40GB file on the SSD every month for a year, since consumer SSDs are supposed to be able to retain data, unpowered, for a year after write exhaustion. It seemed like the Samsung did not even last a month, but it would be good for you to double check. If you do manage to mount the file system read-only, then please compute an MD5 checksum on the ~40GB file.

If you are unable to mount the filesystem at all, then you can proceed with whatever tests you would like to try.

**FoLmEr** · 10-17-2011, 09:50 AM

Hi

First of all, thank you for doing this great project - I've been following this thread every day for weeks now with great interest.

Having an SF-22xx drive myself, I've been especially interested in this aspect of the thread. I've owned a Force GT 120GB for a bit more than a month now and I've had crashes with the drive disappearing from BIOS two times only. While not much compared to other folks, it's still annoying, and maybe even more annoying knowing I've got a... fragile... piece of hardware with the potential to cause trouble when it feels like doing so.

So, may we have a recap on the SF issue and can you say anything concrete on the matter in terms of causes/workarounds at this time?

Also, I don't mind helping out if you have some specific BIOS settings/workloads/etc to try out if it brings the world closer to solving the infamous SF problem. This is my only PC tho and it's not running 24/7, so I may not have the same resources but I'll do what I can if needed.

What I have observed around the SF bug myself is that it's only happened during an overclock with an OFFSET vcore while running the BOINC client with the specific settings of 100% CPUs used and 60% CPU time. This configuration results in an erratic load on the processor going literally from 0% to 100% and back down to 0% in under a second with the vcore being thrown around like mad. The first time it crashed within 2 hours of this load and the second time it happened within 10 hours of this load. C3/6 states disabled, altho C1E and EIST have been enabled.

At the moment I'm testing the same CPU workload but with a static vcore.

**Hopalong X** · 10-17-2011, 10:07 AM

Folmer

Take a look here. This is where they have been working on those problems.> http://www.xtremesystems.org/forums/...nd-workarounds

**bulanula** · 10-17-2011, 10:13 AM

Really interesting to see the results of the 320 and the Samsung. Can't wait. How long will the M4 last now ? Any estimates ? Maybe 1 PB ?

**Ao1** · 10-17-2011, 10:13 AM

Originally Posted by johnw

.....the idea was originally to check the MD5 of the ~40GB file on the SSD every month for a year, since consumer SSDs are supposed to be able to retain data, unpowered, for a year after write exhaustion. It seemed like the Samsung did not even last a month, but it would be good for you to double check. If you do manage to mount the file system read-only, then please compute an MD5 checksum on the ~40GB file.

If you are unable to mount the filesystem at all, then you can proceed with whatever tests you would like to try.

Based on what happened to your drive (and what appears to be happening to One-Hertz's drive) I suspect that is based on MWI exhaustion, not the physical exhaustion of the NAND. It’s a grey however area that manufacturers should confirm.

I am interested to see if/ how well the Samsung managed static data. One_Hertz, is this something you can determine?

@ Anvil, I monitored "normal" activity today using DiskMon and when I plugged in the stats it came out as 37% random/63% sequential for writes, with an average write file size of 64KB. Still not sure why ASU came out as mostly random. Will check into it more later.

**Christopher** · 10-17-2011, 10:16 AM

FoLmEr,

There is a new thread dedicated to this very issue here:

http://www.xtremesystems.org/forums/...=1#post4975129

Could you post your entire system specs there?

And

Are you using the GT as a system drive?

Anything relevant, and perhaps not so relevant could be useful (one day).

**B.A.T** · 10-17-2011, 10:16 AM

New update
m4
693.8066 TiB
2498 hours
Avg speed 90.54 MiB/s.
AD gone from 217 to 211.
P/E 12043.
MD5 OK.
Still no reallocated sectors

Click image for larger version.

Name: M4-CT064 M4SSD2 SATA Disk Device_64GB_1GB-20111017-2004-3.png
Views: 1515
Size: 32.4 KB
ID: 121351

Click image for larger version.

Name: M4-CT064 M4SSD2 SATA Disk Device_64GB_1GB-20111017-2004-2.png
Views: 1448
Size: 79.1 KB
ID: 121352

Kingston V+100
I'll restart it tomorrow.

**One_Hertz** · 10-17-2011, 10:17 AM

Originally Posted by Christopher

One_Hertz,

Do you have to make your own Flash Translation Layer for each drive?

Ideally, if the SSD is accessible through the controller, I would write the logical block address of each sector as the sector content of that sector. Then I would read all the NAND chips directly and have all the data, except it would be in the order it is actually on the flash. The contents of the sectors will allow me to take a very basic look at how the wear leveling algorithm works. Most likely, it will be such a complex mess that I won't be able to figure much out myself, but who knows... it did fail first. I mainly just want to take it apart to see how easily the NAND itself could be read. It might read just fine or it might be a sea of ECC errors. I will see.

Originally Posted by johnw

First, see if you can read anything on the filesystem at all. Just before I broke the SATA power connector, I was unable to get the BIOS to even recognize the SSD. It reached write exhaustion on 2011-Aug-20, but I was still able to read files from it. Then I left it unpowered for a month, and tried to read it again, but the BIOS would not recognize the drive. I fiddled with it a little with no luck, then when I was trying to get it recognized on another computer was when the SATA power connector broke.

The idea was originally to check the MD5 of the ~40GB file on the SSD every month for a year, since consumer SSDs are supposed to be able to retain data, unpowered, for a year after write exhaustion. It seemed like the Samsung did not even last a month, but it would be good for you to double check. If you do manage to mount the file system read-only, then please compute an MD5 checksum on the ~40GB file.

If you are unable to mount the filesystem at all, then you can proceed with whatever tests you would like to try.

I'll try this. Hopefully it is accessible still.

**~~johnw~~** · 10-17-2011, 11:42 AM

Originally Posted by Ao1

Based on what happened to your drive (and what appears to be happening to One-Hertz's drive) I suspect that is based on MWI exhaustion, not the physical exhaustion of the NAND.

No, I'm 99% certain that the flash wore out.

**FoLmEr** · 10-17-2011, 12:04 PM

Cheers for the heads up on the SF thread. I'll keep my SF business inthere

**Christopher** · 10-17-2011, 12:32 PM

Ao1,

Maybe the 3xnm flash in the Samsung is just really consistent and got worn out mostly at the same time. What do you mean by MWI exhaustion vs. Wearing the flash out?

**Ao1** · 10-17-2011, 12:50 PM

What I was trying to say was that MWI is based on the theoretical P/E cycles for NAND. What this thread is showing is that actual P/E capability is significantly more. The Samsung looked like it kept writing until the P/E capability was physically deleted. The 1 year data retention duration however is most likely based on expiry of the theoretical P/E cycles, rather than physical depletion.

Regarding wear out I’m just curious to see if/ how well the Samsung rotated the static data. The high rate of sustained sequential write speeds might be helped by little controller overhead in the form of W/A & WL.

**Anvil** · 10-17-2011, 01:20 PM

Kingston SSDNow 40GB (X25-V)

401.63TB Host writes
Reallocated sectors : 12
MD5 OK

34.83MiB/s on avg (~22 hours)

--

Corsair Force 3 120GB

01 90/50 (Raw read error rate)
05 2 (Retired Block count)
B1 61 (Wear range delta)
E6 100 (Life curve status)
E7 51 (SSD Life left)
E9 189633 (Raw writes)
F1 252531 (Host writes)

107.04MiB/s on avg (~22 hours)

power on hours : 747

B1 is still climbing, I expected it to stay in the 50s.

Thread: SSD Write Endurance 25nm Vs 34nm

Thread Tools

Search Thread

Rate This Thread

Display

Tags for this Thread

Bookmarks

Bookmarks

Posting Permissions