-
Notifications
You must be signed in to change notification settings - Fork 418
I/O issues using flashcache, broken system files #193
Comments
Are there any flashcache messages in the messages file around the time these issues occur ? What version of flashcache are you running ? Are you running master, top of the tree ? From: andrey-minsky [email protected] Hello, we are using 2x480GB SSD RAID0 as cache for our VPS nodes, but with latest versions we have a lot of issues. Some examples of logs: |
cat /proc/flashcache/md0+scsi-3600605b0057db8401a4b346b2d4d3838-part2/flashcache_stats I mean write_invalidates/read_invalidates errors. We are using master version flashcache-3.1.1 |
Are there flashcache messages in /var/log/messages that show IO errors either from flash or disk ? Flashcache is returning EIO (or at least the application is reporting EIO). Are either the underlying disk or flash returning EIO ? Can you paste the output from dmsetup status and dmsetup table ? Write Invalidate is not an error. From: andrey-minsky [email protected] cat /proc/flashcache/md0+scsi-3600605b0057db8401a4b346b2d4d3838-part2/flashcache_stats |
We had same errors on setup with HDD storage with 4k block size. We tried different settings on cache creates, blocks sizes etc.. But was solved just by moving back to 512b sectors on HDD |
Kaydannik, usefull information, thanks, but our node in production already and we can change block size, also, before, in older versions everything was ok. May be possible somehow fix this issue in new version of flashcache? Or we can format SSDs with same block size as HDDs? |
We check once again, our block sizes are similar and equal to 512. We do not have any flashcache errors in logs, requested logs attached: |
I have a similar set up with flashcache running on KVM and windows templates. On Linux templates the same settings work fine. The issue is system files randomly become corrupted, in write through mode. Simply turning off cache all solves the issue. Flashcache reports no errors. I am including information below. I will do a more detailed test soon with virtio on / off, and different options in KVM/libvirt. But the KVM settings the same way do work on linux guests. version flashcache-3.1.1 status table stats disk_read_errors=0 disk_write_errors=0 ssd_read_errors=0 ssd_write_errors=0 memory_alloc_errors=0 |
A little update on my above post. I can reproduce the errors with flashcache on and windows consistently but also appear to have a solution. Setting a password or windows updates results in errors like: http://i.is.cc/1yPQJu2r.png (second or third time it goes though) My normal set up for linux and windows servers in the xml file is driver name='qemu' type='raw' cache='none'/ if I change this to driver name='qemu' type='raw' cache='writeback' threads='native'/ I no longer see any errors in windows guests with flashcache. |
Thank you for reporting this issue and appreciate your patience. We've notified the core team for an update on this issue. We're looking for a response within the next 30 days or the issue may be closed. |
same as issue: can be reproduced easily. |
Hello, we are using 2x480GB SSD RAID0 as cache for our VPS nodes, but with latest versions we have a lot of issues. Some examples of logs:
[ 42.722313] Buffer I/O error on device xvda1, logical block 309394
[ 42.722317] lost page write due to I/O error on xvda1
[ 42.730988] end_request: I/O error, dev xvda1, sector 2475344
[ 42.739014] end_request: I/O error, dev xvda1, sector 2475600
Issue can be solved by node reboot.
Also, Windows VPS after some time of using on node with flashcache have broken system files and not booting correctly - its very big problem...
The text was updated successfully, but these errors were encountered: