Pages

Sunday, December 23, 2007

NVIDIA Issues on multi-core/SMP CPU continues

[BradS@xps-m1710 ~]$ sudo tail -100f /var/log/messages | grep -i nvrm
Dec 23 18:11:30 xps-m1710 kernel: NVRM: loading NVIDIA UNIX x86_64 Kernel Module 169.07 Thu Dec 13 18:34:01 PST 2007
Dec 23 18:15:01 xps-m1710 kernel: NVRM: loading NVIDIA UNIX x86_64 Kernel Module 169.07 Thu Dec 13 18:34:01 PST 2007
Dec 23 19:30:48 xps-m1710 kernel: NVRM: Xid (0001:00): 6, PE0000 0280 00002000 0000ab90 00000000 ccafa493
Dec 23 19:30:48 xps-m1710 kernel: NVRM: Xid (0001:00): 36, L1 -> L0


Continue to have issues, now using the x86_64 169.07 NVIDIA driver on 2.6.23.9-85.fc8 kernel

This time watching the Core Temperature, I had a sudden lock up when it peaked up at 62C, then the screen did something it never did before, it unfrooze ... which is new, usually a lock up meant reboot, but this time within 5 seconds it unfrooze ... right now I am monitoring the Core Tem and turned off the Thermal Monitor on the NVIDIA-settings wizard, let's see what happens this time

If you continue to have issues with multicore cpu and nvidia drivers look here:



Closed Thread
Thread Tools Search this Thread
Old 10-21-05, 10:01 AM #1
zander
NVIDIA Corporation

zander's Avatar

Join Date: Aug 2002
Posts: 2,904
Exclamation If you have a stability problem, PLEASE read this first

The NVIDIA Linux graphics drivers rely on the Linux kernel's change_page_attr() interface function to change the kernel mappings' cache attributes for system memory pages used in DMA transfers. It is crucial for reliable operation that this interface is working correctly. Unfortunately, many presently deployed Linux 2.4 and Linux 2.6 kernels have known problems in their implementations of this interface.

Please check the following before reporting stability problems:
  • If you are seeing severe stability problems and you are using a Linux 2.6 SMP kernel on a system with multiple processors (or processor cores) in combination with more than one GPU, please search the output of `dmesg` for the presence of the message below after the system has just been started:
    PCI: Using MMCONFIG

    If this message is present, please boot the system with the pci=nommconf kernel parameter and check if the stability problems continue to reproduce.

  • If your system is equipped with a dual-core processor, booting with the idle=poll and/or maxcpus=1 kernel parameters may improve reliability with some Linux kernels.

  • If you are using an AGP graphics card, please test setting NvAGP to 0 in xorg.conf. If this eliminates the instability, then you are experiencing a problem outside of the NVIDIA X driver, either in the motherboard BIOS, kernel, kernel AGP driver, or possibly in the motherboard itself.

  • If you are using a Linux/x86-64 2.6 kernel and see the warning message below during the installation and/or in the system log file(s) when the NVIDIA kernel module is loaded, please upgrade your kernel to Linux 2.6.11 or a more recent stable Linux 2.6 release. Linux/x86-64 2.6 kernels < Linux 2.6.11 have an accounting bug in their implementation of the change_page_attr() interface that can trigger a kernel BUG(). The 1.0-7174 and 1.0-7676 NVIDIA Linux/x86-64 graphics driver releases work around this problem by disabling use of the change_page_attr() interface on these kernels.

    NVRM: Your Linux kernel has known problems in its implementation of
    NVRM: the change_page_attr() kernel interface.
    NVRM:
    NVRM: The NVIDIA graphics driver will attempt to work around these
    NVRM: problems, but system stability may be adversely affected.
    NVRM: It is recommended that you update to Linux 2.6.11 (or a newer
    NVRM: Linux kernel release).

  • If you are using the 1.0-7676 NVIDIA Linux/x86-64 graphics driver release and a Linux/x86-64 2.6 kernel < Linux 2.6.11, but know that your kernel has a working implementation of the change_page_attr() kernel interface function (e.g. RHEL4's Linux 2.6.9-22.3.EL beta kernel), you can pass the NVIDIA kernel module the NVreg_UseCPA=1 kernel module option to re-enable use of this interface. The 1.0-8174 and more recent NVIDIA Linux graphics driver releases will automatically detect affected kernels and enable use of the change_page_attr() interface when it is safe to use.

  • If you are using a Linux/x86-64 kernel >= Linux 2.6.11 and < Linux 2.6.14, please also see http://www.nvnews.net/vbulletin/showthread.php?t=57990 for important information on updating the 1.0-7174 or 1.0-7676 NVIDIA Linux/x86-64 graphics driver releases to work around a kernel bug in the global_flush_tlb() kernel interface function. If you are using the 1.0-8174 or a more recent NVIDIA Linux graphics driver release, no manual update is necessary.

  • If you see warning messages similar to those below in the system log file(s) when starting the X server or OpenGL applications, then please update your kernel. If the problem persists with the latest distribution kernel, then please contact your distributor and submit a bug report to linux-bugs@nvidia.com. Please also read the section on Cache Aliasing in the NVIDIA Linux graphics driver README.

    NVRM: bad caching on address 0x100362d0000: actual 0x163 != expected 0x173
    NVRM: bad caching on address 0x100362d1000: actual 0x163 != expected 0x173
    NVRM: bad caching on address 0x1003be7c000: actual 0x163 != expected 0x173
    NVRM: bad caching on address 0x1003be7d000: actual 0x163 != expected 0x173


  • For any problem that involves instability, you should always verify that you are using the most recently released BIOS for the motherboard.

Please note: if you have checked the above items and continue to experience stability problems, please submit a bug report (see http://www.nvnews.net/vbulletin/showthread.php?t=46678 for information on how to submit bug reports).

Thanks!
zander is offline
Closed Thread



Currently Active Users Viewing This Thread: 1 (0 members and 1 guests)

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

vB code is On
Smilies are On
[IMG] code is On
HTML code is Off
Forum Jump


All times are GMT -4. The time now is 08:34 PM.


Powered by vBulletin Version 3.5.3
Copyright ©2000 - 2007, Jelsoft Enterprises Ltd.

1 comment:

Blogger said...

Need To Increase Your ClickBank Banner Traffic And Commissions?

Bannerizer made it easy for you to promote ClickBank products by banners, simply visit Bannerizer, and grab the banner codes for your chosen ClickBank products or use the Universal ClickBank Banner Rotator to promote all of the ClickBank products.