Par jerome le 06/10/2005 - 11:31
Bonjour,
Merci Ă ceux qui prendront le temps de lire ce long message.
Dans le cadre d'une infra WEB PHP/PostgreSQL (environ 250 000 pages / jour), j'ai installé un serveur PostgreSQL 7.4.2 sur une Suse Enterprise 9 (Kernel 2.6.5-7.97-smp x86_64) sur une bi-opteron (2 CPU Opteron 240 1,4GHz, 1M Cache) avec 8Go de RAM.
Mon problème est que, plus ou moins régulièrement, le système gêle (environ 1 fois tous les 15 jours), occasionnant un arrêt toujours délicat du service
================================================================================================================
Param système:
/proc/sys/kernel/shmmax: 1 073 741 824
/proc/sys/kernel/shmall: 2 097 152 (valeur par défaut)
/proc/sys/kernel/shmmi : 4 096 (valeur par défaut)
================================================================================================================
Mon fichier de config est le suivant:
# -----------------------------
# PostgreSQL configuration file
# -----------------------------
#
# This file consists of lines of the form:
#
# name = value
#
# (The '=' is optional.) White space may be used. Comments are introduced
# with '#' anywhere on a line. The complete list of option names and
# allowed values can be found in the PostgreSQL documentation. The
# commented-out settings shown in this file represent the default values.
#
# Any option can also be given as a command line switch to the
# postmaster, e.g. 'postmaster -c log_connections=on'. Some options
# can be changed at run-time with the 'SET' SQL command.
#
# This file is read on postmaster startup and when the postmaster
# receives a SIGHUP. If you edit the file on a running system, you have
# to SIGHUP the postmaster for the changes to take effect, or use
# "pg_ctl reload".
#---------------------------------------------------------------------------
# CONNECTIONS AND AUTHENTICATION
#---------------------------------------------------------------------------
# - Connection Settings -
tcpip_socket = true
max_connections = 1024
# note: increasing max_connections costs about 500 bytes of shared
# memory per connection slot, in addition to costs from shared_buffers
# and max_locks_per_transaction.
#superuser_reserved_connections = 2
#port = 5432
#unix_socket_directory = ''
#unix_socket_group = ''
#unix_socket_permissions = 0777 # octal
#virtual_host = '' # what interface to listen on; defaults to any
#rendezvous_name = '' # defaults to the computer name
# - Security & Authentication -
#authentication_timeout = 60 # 1-600, in seconds
#ssl = false
#password_encryption = true
#krb_server_keyfile = ''
#db_user_namespace = false
#---------------------------------------------------------------------------
# RESOURCE USAGE (except WAL)
#---------------------------------------------------------------------------
# - Memory -
#shared_buffers = 1000 # min 16, at least max_connections*2, 8KB each
shared_buffers = 4096
#sort_mem = 1024 # min 64, size in KB
#vacuum_mem = 8192 # min 1024, size in KB
# - Free Space Map -
#max_fsm_pages = 20000 # min max_fsm_relations*16, 6 bytes each
#max_fsm_relations = 1000 # min 100, ~50 bytes each
# - Kernel Resource Usage -
#max_files_per_process = 1000 # min 25
#preload_libraries = ''
#---------------------------------------------------------------------------
# WRITE AHEAD LOG
#---------------------------------------------------------------------------
# - Settings -
#fsync = true # turns forced synchronization on or off
fsync = false
#wal_sync_method = fsync # the default varies across platforms:
# fsync, fdatasync, open_sync, or open_datasync
#wal_buffers = 8 # min 4, 8KB each
# - Checkpoints -
#checkpoint_segments = 3 # in logfile segments, min 1, 16MB each
#checkpoint_timeout = 300 # range 30-3600, in seconds
#checkpoint_warning = 30 # 0 is off, in seconds
#commit_delay = 0 # range 0-100000, in microseconds
#commit_siblings = 5 # range 1-1000
#---------------------------------------------------------------------------
# QUERY TUNING
#---------------------------------------------------------------------------
# - Planner Method Enabling -
#enable_hashagg = true
#enable_hashjoin = true
#enable_indexscan = true
#enable_mergejoin = true
#enable_nestloop = true
#enable_seqscan = true
#enable_sort = true
#enable_tidscan = true
# - Planner Cost Constants -
#effective_cache_size = 1000 # typically 8KB each
#random_page_cost = 4 # units are one sequential page fetch cost
#cpu_tuple_cost = 0.01 # (same)
#cpu_index_tuple_cost = 0.001 # (same)
#cpu_operator_cost = 0.0025 # (same)
# - Genetic Query Optimizer -
#geqo = true
#geqo_threshold = 11
#geqo_effort = 1
#geqo_generations = 0
#geqo_pool_size = 0 # default based on tables in statement,
# range 128-1024
#geqo_selection_bias = 2.0 # range 1.5-2.0
# - Other Planner Options -
#default_statistics_target = 10 # range 1-1000
#from_collapse_limit = 8
#join_collapse_limit = 8 # 1 disables collapsing of explicit JOINs
#---------------------------------------------------------------------------
# ERROR REPORTING AND LOGGING
#---------------------------------------------------------------------------
# - Syslog -
#syslog = 0 # range 0-2; 0=stdout; 1=both; 2=syslog
#syslog_facility = 'LOCAL0'
#syslog_ident = 'postgres'
# - When to Log -
#client_min_messages = notice # Values, in order of decreasing detail:
# debug5, debug4, debug3, debug2, debug1,
# log, info, notice, warning, error
#log_min_messages = notice # Values, in order of decreasing detail:
# debug5, debug4, debug3, debug2, debug1,
# info, notice, warning, error, log, fatal,
# panic
#log_error_verbosity = default # terse, default, or verbose messages
#log_min_error_statement = panic # Values in order of increasing severity:
# debug5, debug4, debug3, debug2, debug1,
# info, notice, warning, error, panic(off)
#log_min_duration_statement = -1 # Log all statements whose
# execution time exceeds the value, in
# milliseconds. Zero prints all queries.
# Minus-one disables.
#silent_mode = false # DO NOT USE without Syslog!
# - What to Log -
#debug_print_parse = false
#debug_print_rewritten = false
#debug_print_plan = false
#debug_pretty_print = false
#log_connections = false
#log_duration = false
#log_pid = false
#log_statement = false
log_timestamp = true
#log_hostname = false
#log_source_port = false
#---------------------------------------------------------------------------
# RUNTIME STATISTICS
#---------------------------------------------------------------------------
# - Statistics Monitoring -
#log_parser_stats = false
#log_planner_stats = false
#log_executor_stats = false
#log_statement_stats = false
# - Query/Index Statistics Collector -
#stats_start_collector = true
#stats_command_string = false
#stats_block_level = false
#stats_row_level = false
#stats_reset_on_server_start = true
#---------------------------------------------------------------------------
# CLIENT CONNECTION DEFAULTS
#---------------------------------------------------------------------------
# - Statement Behavior -
#search_path = '$user,public' # schema names
#check_function_bodies = true
#default_transaction_isolation = 'read committed'
#default_transaction_read_only = false
#statement_timeout = 0 # 0 is disabled, in milliseconds
# - Locale and Formatting -
#datestyle = 'iso, mdy'
#timezone = unknown # actually, defaults to TZ environment setting
#australian_timezones = false
#extra_float_digits = 0 # min -15, max 2
#client_encoding = sql_ascii # actually, defaults to database encoding
# These settings are initialized by initdb -- they may be changed
lc_messages = 'fr_FR.UTF-8' # locale for system error message strings
lc_monetary = 'fr_FR.UTF-8' # locale for monetary formatting
lc_numeric = 'fr_FR.UTF-8' # locale for number formatting
lc_time = 'fr_FR.UTF-8' # locale for time formatting
# - Other Defaults -
#explain_pretty_print = true
#dynamic_library_path = '$libdir'
#max_expr_depth = 10000 # min 10
#---------------------------------------------------------------------------
# LOCK MANAGEMENT
#---------------------------------------------------------------------------
#deadlock_timeout = 1000 # in milliseconds
#max_locks_per_transaction = 64 # min 10, ~260*max_connections bytes each
#---------------------------------------------------------------------------
# VERSION/PLATFORM COMPATIBILITY
#---------------------------------------------------------------------------
# - Previous Postgres Versions -
#add_missing_from = true
#regex_flavor = advanced # advanced, extended, or basic
#sql_inheritance = true
# - Other Platforms & Clients -
#transform_null_equals = false
================================================================================================================
Extrait du fichier de log (j'ai des milliers de lignes du mĂŞme genre)
Oct 5 20:51:55 sgbdmaitre kernel: Unable to handle kernel paging request at 0000001000000068 RIP:
Oct 5 20:51:55 sgbdmaitre kernel: {__vma_prio_tree_remove+63}
Oct 5 20:51:55 sgbdmaitre kernel: PML4 14649067 PGD 0
Oct 5 20:51:55 sgbdmaitre kernel: Oops: 0000 [1] SMP
Oct 5 20:51:55 sgbdmaitre kernel: CPU 0
Oct 5 20:51:55 sgbdmaitre kernel: Pid: 28595, comm: postmaster Not tainted 2.6.5-7.97-smp
Oct 5 20:51:55 sgbdmaitre kernel: RIP: 0010:[] {__vma_prio_tree_remove+63}
Oct 5 20:51:55 sgbdmaitre kernel: RSP: 0018:000001009b599ea8 EFLAGS: 00010206
Oct 5 20:51:55 sgbdmaitre kernel: RAX: 0000000000000000 RBX: 00000100afa36340 RCX: 00000100afa36340
Oct 5 20:51:55 sgbdmaitre kernel: RDX: 00000101fc3b5650 RSI: 00000100afa36340 RDI: 00000101fc3b5688
Oct 5 20:51:55 sgbdmaitre kernel: RBP: 00000100afa36290 R08: 0000001000000000 R09: 000001009b599ee8
Oct 5 20:51:55 sgbdmaitre kernel: R10: 0000002a9664bee8 R11: 0000000000000246 R12: 00000101fc3b5650
Oct 5 20:51:55 sgbdmaitre kernel: R13: 0000000000000000 R14: 0000000000000000 R15: 0000000000000000
Oct 5 20:51:55 sgbdmaitre kernel: FS: 0000002a96de1aa0(0000) GS:ffffffff804e7e00(0000) knlGS:00000000556ba080
Oct 5 20:51:56 sgbdmaitre kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 000000008005003b
Oct 5 20:51:56 sgbdmaitre kernel: CR2: 0000001000000068 CR3: 0000000000101000 CR4: 00000000000006e0
Oct 5 20:51:58 sgbdmaitre kernel: Process postmaster (pid: 28595, threadinfo 000001009b598000, task 000001010eb1d310)
Oct 5 20:51:58 sgbdmaitre kernel: Stack: 00000101fc3b56b8 ffffffff80170a2b 0000000000000216 00000100afa36340
Oct 5 20:51:58 sgbdmaitre kernel: 000001010eb1d310 ffffffff80170c52 0000000000000246 0000010010000140
Oct 5 20:51:58 sgbdmaitre kernel: 0000000000000653 0000010028515000
Oct 5 20:51:58 sgbdmaitre kernel: Call Trace:{remove_shared_vm_struct+75} {exit_mmap+514}
Oct 5 20:51:58 sgbdmaitre kernel: {mmput+88} {do_exit+547}
Oct 5 20:51:58 sgbdmaitre kernel: {do_group_exit+232} {system_call+124}
Oct 5 20:51:58 sgbdmaitre kernel:
Oct 5 20:51:58 sgbdmaitre kernel:
Oct 5 20:51:58 sgbdmaitre kernel: Code: 49 39 58 68 74 0c 0f 0b 77 2a 36 80 ff ff ff ff 1e 02 48 85
Oct 5 20:51:58 sgbdmaitre kernel: RIP {__vma_prio_tree_remove+63} RSP <000001009b599ea8>
Oct 5 20:51:58 sgbdmaitre kernel: CR2: 0000001000000068
Oct 5 20:51:58 sgbdmaitre kernel: <1>Unable to handle kernel NULL pointer dereference at 0000000000000028 RIP:
Oct 5 20:51:58 sgbdmaitre kernel: {mm_release+86}
Oct 5 20:51:58 sgbdmaitre kernel: PML4 14649067 PGD 0
Oct 5 20:51:58 sgbdmaitre kernel: Oops: 0000 [2] SMP
Oct 5 20:51:58 sgbdmaitre kernel: CPU 0
Oct 5 20:51:58 sgbdmaitre kernel: Pid: 28595, comm: postmaster Not tainted 2.6.5-7.97-smp
Oct 5 20:51:58 sgbdmaitre kernel: RIP: 0010:[] {mm_release+86}
Oct 5 20:51:58 sgbdmaitre kernel: RSP: 0018:000001009b599c98 EFLAGS: 00010206
Oct 5 20:51:58 sgbdmaitre kernel: RAX: 000001010eb1d310 RBX: 000001010eb1d310 RCX: ffffffff803b1df8
Oct 5 20:51:58 sgbdmaitre kernel: RDX: 000001010eb1d310 RSI: 0000000000000000 RDI: 0000002a96de1b30
Oct 5 20:51:58 sgbdmaitre kernel: RBP: 0000000000000000 R08: 0000000000000040 R09: ffffffff80522880
Oct 5 20:51:59 sgbdmaitre kernel: R10: 00000000000493e0 R11: 0000000000002710 R12: 0000000000000000
Oct 5 20:52:00 sgbdmaitre kernel: R13: 0000000000000000 R14: 0000000000000009 R15: 0000000000000000
Oct 5 20:52:00 sgbdmaitre kernel: FS: 0000002a96de1aa0(0000) GS:ffffffff804e7e00(0000) knlGS:00000000556ba080
Oct 5 20:52:01 sgbdmaitre kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 000000008005003b
Oct 5 20:52:01 sgbdmaitre kernel: CR2: 0000000000000028 CR3: 0000000000101000 CR4: 00000000000006e0
Oct 5 20:52:02 sgbdmaitre kernel: Process postmaster (pid: 28595, threadinfo 000001009b598000, task 000001010eb1d310)
Oct 5 20:52:02 sgbdmaitre kernel: Stack: ffffffff803b1df0 0000000000000000 0000001000000068 000001010eb1d310
Oct 5 20:52:02 sgbdmaitre kernel: 000001009b599df8 ffffffff8013d7e8 000001010eb1d310 ffffffffffffffef
Oct 5 20:52:02 sgbdmaitre kernel: ffffffff80111cb3 0000000000000001
Oct 5 20:52:02 sgbdmaitre kernel: Call Trace:{do_exit+328} {oops_end+35}
Oct 5 20:52:02 sgbdmaitre kernel: {do_page_fault+1200} {error_exit+0}
Oct 5 20:52:02 sgbdmaitre kernel: {__vma_prio_tree_remove+63} it+232} {system_call+124}
Oct 5 20:52:02 sgbdmaitre kernel:
Oct 5 20:52:02 sgbdmaitre kernel:
Oct 5 20:52:02 sgbdmaitre kernel: Code: 41 8b 45 28 ff c8 7e 63 48 c7 83 78 02 00 00 00 00 00 00 65
Oct 5 20:52:02 sgbdmaitre kernel: RIP {mm_release+86} RSP <000001009b5983d8>
Oct 5 20:52:02 sgbdmaitre kernel: CR2: 0000000000000028
Oct 5 20:52:02 sgbdmaitre kernel: <0>Kernel panic: Aiee, killing interrupt handler!
Oct 5 20:52:02 sgbdmaitre kernel: Unable to handle kernel paging request at fffffffea00080a2 RIP:
Oct 5 20:52:02 sgbdmaitre kernel: []
Oct 5 20:52:02 sgbdmaitre kernel: PML4 103027 PGD 0
Oct 5 20:52:02 sgbdmaitre kernel: Oops: 0010 [15] SMP
Oct 5 20:52:02 sgbdmaitre kernel: CPU 0
Oct 5 20:52:02 sgbdmaitre kernel: Pid: 28595, comm: postmaster Not tainted 2.6.5-7.97-smp
Oct 5 20:52:02 sgbdmaitre kernel: RIP: 0010:[] []
Oct 5 20:52:02 sgbdmaitre kernel: RSP: 0018:000001009b598018 EFLAGS: 00010212
Oct 5 20:52:02 sgbdmaitre kernel: RAX: 00000000a00080a2 RBX: 0000010008367c00 RCX: 00000000c0000100
Oct 5 20:52:02 sgbdmaitre kernel: RDX: 00000100fbe769a0 RSI: 000001010eb1d310 RDI: 000001017c8f4c00
Oct 5 20:52:05 sgbdmaitre kernel: RBP: 0000000000049b18 R08: 000001009b598000 R09: 00000000ffffffff
Oct 5 20:52:05 sgbdmaitre kernel: R10: 0000010097f333c0 R11: 0000000000000000 R12: 0000010008367c98
Oct 5 20:52:05 sgbdmaitre kernel: R13: 000001009b598018 R14: 0000010008367cc8 R15: 0000000000000000
Oct 5 20:52:05 sgbdmaitre kernel: FS: 0000002a96de1aa0(0000) GS:ffffffff804e7e00(0000) knlGS:00000000556ba080
Oct 5 20:52:05 sgbdmaitre kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 000000008005003b
Oct 5 20:52:05 sgbdmaitre kernel: CR2: fffffffea00080a2 CR3: 0000000000101000 CR4: 00000000000006e0
Oct 5 20:52:05 sgbdmaitre kernel: Process postmaster (pid: 28595, threadinfo 000001009b598000, task 000001010eb1d310)
Oct 5 20:52:05 sgbdmaitre kernel: Stack: 0000000000000000 000001010eb1d310 ffffffff80134fd0 0000010008367ca0
Oct 5 20:52:05 sgbdmaitre kernel: 0000010008367ca0 0000010008367c00 000001009b59808c 0000010008364c00
Oct 5 20:52:05 sgbdmaitre kernel: 0000000000000001 0000000000000001
Oct 5 20:52:05 sgbdmaitre kernel: Call Trace:{default_wake_function+0} {:ext3:ext3_sync_fs+78}
Oct 5 20:52:05 sgbdmaitre kernel: {sync_filesystems+223} {do_sync+49}
Oct 5 20:52:05 sgbdmaitre kernel: {sys_sync+62} {panic+262}
Oct 5 20:52:05 sgbdmaitre kernel: {printk+511} {do_exit+93}
Oct 5 20:52:05 sgbdmaitre kernel: {oops_end+35} {do_page_fault+1200}
Oct 5 20:52:05 sgbdmaitre kernel: {poke_blanked_console+179} {vt_console_print+727}
Oct 5 20:52:05 sgbdmaitre kernel: {__call_console_drivers+76} {error_exit+0}
Oct 5 20:52:05 sgbdmaitre kernel: {mm_release+86} {mm_release+47}
Oct 5 20:52:05 sgbdmaitre kernel: {do_exit+328} {oops_end+35}
Oct 5 20:52:05 sgbdmaitre kernel: {do_page_fault+1200} {poke_blanked_console+179}
Oct 5 20:52:05 sgbdmaitre kernel: {vt_console_print+727} {__call_console_drivers+76}
Oct 5 20:52:05 sgbdmaitre kernel: {error_exit+0} {mm_release+86}
Oct 5 20:52:05 sgbdmaitre kernel: {mm_release+47} {do_exit+328}
Oct 5 20:52:05 sgbdmaitre kernel: {oops_end+35} {do_page_fault+1200}
Oct 5 20:52:05 sgbdmaitre kernel: {poke_blanked_console+179} {vt_console_print+727}
Oct 5 20:52:05 sgbdmaitre kernel: {__call_console_drivers+76} {error_exit+0}
Oct 5 20:52:05 sgbdmaitre kernel: {mm_release+86} {mm_release+47}
Oct 5 20:52:05 sgbdmaitre kernel: {do_exit+328} {oops_end+35}
Oct 5 20:52:05 sgbdmaitre kernel: {do_page_fault+1200} {poke_blanked_console+179}
Oct 5 20:52:05 sgbdmaitre kernel: {vt_console_print+727} {__call_console_drivers+76}
Oct 5 20:52:05 sgbdmaitre kernel: {error_exit+0} {mm_release+86}
Oct 5 20:52:05 sgbdmaitre kernel: {mm_release+47} {do_exit+328}
Oct 5 20:52:05 sgbdmaitre kernel: {oops_end+35} {do_page_fault+1200}
Oct 5 20:52:05 sgbdmaitre kernel: {poke_blanked_console+179} {vt_console_print+727}
Oct 5 20:52:05 sgbdmaitre kernel: {__call_console_drivers+76} {error_exit+0}
Oct 5 20:52:05 sgbdmaitre kernel: {mm_release+86} {mm_release+47}
Oct 5 20:52:05 sgbdmaitre kernel: {do_exit+328} {oops_end+35}
Oct 5 20:52:05 sgbdmaitre kernel: {do_page_fault+1200} {poke_blanked_console+179}
Oct 5 20:52:05 sgbdmaitre kernel: {vt_console_print+727} {__call_console_drivers+76}
Oct 5 20:52:05 sgbdmaitre kernel: {error_exit+0} {mm_release+86}
Oct 5 20:52:05 sgbdmaitre kernel: {mm_release+47} {do_exit+328}
Oct 5 20:52:05 sgbdmaitre kernel: {oops_end+35} {do_page_fault+1200}
Oct 5 20:52:05 sgbdmaitre kernel: {poke_blanked_console+179} {vt_console_print+727}
Oct 5 20:52:05 sgbdmaitre kernel: {__call_console_drivers+76} {error_exit+0}
Oct 5 20:52:05 sgbdmaitre kernel: {mm_release+86} {mm_release+47}
Oct 5 20:52:05 sgbdmaitre kernel: {do_exit+328} {oops_end+35}
Oct 5 20:52:05 sgbdmaitre kernel: {do_page_fault+1200} {poke_blanked_console+179}
Oct 5 20:52:05 sgbdmaitre kernel: {vt_console_print+727} {__call_console_drivers+76}
Oct 5 20:52:05 sgbdmaitre kernel: {error_exit+0} {mm_release+86}
Oct 5 20:52:05 sgbdmaitre kernel: {mm_release+47} {do_exit+328}
Oct 5 20:52:05 sgbdmaitre kernel: {oops_end+35} {do_page_fault+1200}
Oct 5 20:52:05 sgbdmaitre kernel: {poke_blanked_console+179} {vt_console_print+727}
Oct 5 20:52:05 sgbdmaitre kernel: {__call_console_drivers+76} {error_exit+0}
Oct 5 20:52:05 sgbdmaitre kernel: {mm_release+86} {mm_release+47}
Oct 5 20:52:05 sgbdmaitre kernel: {do_exit+328} {oops_end+35}
Oct 5 20:52:05 sgbdmaitre kernel: {do_page_fault+1200} {poke_blanked_console+179}
Oct 5 20:52:05 sgbdmaitre kernel: {vt_console_print+727} {__call_console_drivers+76}
Oct 5 20:52:05 sgbdmaitre kernel: {error_exit+0} {mm_release+86}
Oct 5 20:52:05 sgbdmaitre kernel: {mm_release+47} {do_exit+328}
Oct 5 20:52:05 sgbdmaitre kernel: {oops_end+35} {do_page_fault+1200}
Oct 5 20:52:05 sgbdmaitre kernel: {poke_blanked_console+179} {vt_console_print+727}
Oct 5 20:52:05 sgbdmaitre kernel: {__call_console_drivers+76} {error_exit+0}
Oct 5 20:52:05 sgbdmaitre kernel: {mm_release+86} {mm_release+47}
Oct 5 20:52:05 sgbdmaitre kernel: {do_exit+328} {oops_end+35}
Oct 5 20:52:05 sgbdmaitre kernel: {do_page_fault+1200} {poke_blanked_console+179}
Oct 5 20:52:05 sgbdmaitre kernel: {vt_console_print+727} {__call_console_drivers+76}
Oct 5 20:52:05 sgbdmaitre kernel: {error_exit+0} {mm_release+86}
Oct 5 20:52:05 sgbdmaitre kernel: {mm_release+47} {do_exit+328}
Oct 5 20:52:05 sgbdmaitre kernel: {oops_end+35} {do_page_fault+1200}
Oct 5 20:52:05 sgbdmaitre kernel: {__wake_up_common+64} {error_exit+0}
Oct 5 20:52:05 sgbdmaitre kernel: {mm_release+86} {mm_release+47}
Oct 5 20:52:05 sgbdmaitre kernel: {do_exit+328} {oops_end+35}
Oct 5 20:52:05 sgbdmaitre kernel: {do_page_fault+1200} {error_exit+0}
Oct 5 20:52:05 sgbdmaitre kernel: {__vma_prio_tree_remove+63} {free_pages_and_swap_cache+116}
Oct 5 20:52:05 sgbdmaitre kernel: {remove_shared_vm_struct+75} {exit_mmap+514}
Oct 5 20:52:05 sgbdmaitre kernel: {mmput+88} {do_exit+547}
Oct 5 20:52:05 sgbdmaitre kernel: {do_group_exit+232} {system_call+124}
Oct 5 20:52:05 sgbdmaitre kernel:
Oct 5 20:52:05 sgbdmaitre kernel:
Oct 5 20:52:05 sgbdmaitre kernel: Code: Bad RIP value.
Oct 5 20:52:05 sgbdmaitre kernel: RIP [] RSP <000001009b598018>
Oct 5 20:52:05 sgbdmaitre kernel: CR2: fffffffea00080a2
Oct 5 20:52:05 sgbdmaitre kernel: <1>Unable to handle kernel NULL pointer dereference at 0000000000000028 RIP:
Oct 5 20:52:05 sgbdmaitre kernel: {mm_release+86}
Oct 5 20:52:05 sgbdmaitre kernel: PML4 83b9b067 PGD 307ab067 PMD 0
Oct 5 20:52:05 sgbdmaitre kernel: Oops: 0000 [16] SMP
Oct 5 20:52:05 sgbdmaitre kernel: CPU 0
Oct 5 20:52:05 sgbdmaitre kernel: Pid: 28595, comm: postmaster Not tainted 2.6.5-7.97-smp
Oct 5 20:52:05 sgbdmaitre kernel: RIP: 0010:[] {mm_release+86}
Oct 5 20:52:05 sgbdmaitre kernel: RSP: 0018:000001009b597e08 EFLAGS: 00010206
Oct 5 20:52:05 sgbdmaitre kernel: RAX: 000001010eb1d310 RBX: 000001010eb1d310 RCX: ffffffff803b1df8
Oct 5 20:52:05 sgbdmaitre kernel: RDX: 000001010eb1d310 RSI: 0000000000000000 RDI: 0000002a96de1b30
Oct 5 20:52:05 sgbdmaitre kernel: RBP: 0000000000000000 R08: 0000000000000040 R09: ffffffff80522880
Oct 5 20:52:06 sgbdmaitre kernel: R10: 0000000000000000 R11: 0000000000000000 R12: 0000000000000000
Oct 5 20:52:06 sgbdmaitre kernel: R13: 0000000000000000 R14: 0000000000000009 R15: 0000000000000000
Oct 5 20:52:06 sgbdmaitre kernel: FS: 0000002a96de1aa0(0000) GS:ffffffff804e7e00(0000) knlGS:00000000556ba080
Oct 5 20:52:06 sgbdmaitre kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 000000008005003b
Oct 5 20:52:06 sgbdmaitre kernel: CR2: 0000000000000028 CR3: 0000000000101000 CR4: 00000000000006e0
Oct 5 20:52:06 sgbdmaitre kernel: Process postmaster (pid: 28595, threadinfo 000001009b598000, task 000001010eb1d310)
Oct 5 20:52:06 sgbdmaitre kernel: Stack: ffffffff803b1df0 0000000000000000 fffffffea00080a2 000001010eb1d310
Oct 5 20:52:06 sgbdmaitre kernel: 000001009b597f68 ffffffff8013d7e8 ffffffff803b1da0 ffffffffffffffef
Oct 5 20:52:06 sgbdmaitre kernel: ffffffff80111cb3 0000000000000001
Oct 5 20:52:06 sgbdmaitre kernel: Call Trace:{do_exit+328} {oops_end+35}
Oct 5 20:52:06 sgbdmaitre kernel: {do_page_fault+1200} {thread_return+0}
Oct 5 20:52:06 sgbdmaitre kernel: {error_exit+0}
Oct 5 20:52:06 sgbdmaitre kernel:
Oct 5 20:52:06 sgbdmaitre kernel: Code: 41 8b 45 28 ff c8 7e 63 48 c7 83 78 02 00 00 00 00 00 00 65
Oct 5 20:52:06 sgbdmaitre kernel: RIP {mm_release+86} RSP <000001009b597e08>
Oct 5 20:52:06 sgbdmaitre kernel: CR2: 0000000000000028
Oct 5 20:52:06 sgbdmaitre kernel: <1>Unable to handle kernel NULL pointer dereference at 0000000000000028 RIP:
Oct 5 20:52:06 sgbdmaitre kernel: {mm_release+86}
Oct 5 20:52:06 sgbdmaitre kernel: PML4 83b9b067 PGD 307ab067 PMD 0
Oct 5 20:52:06 sgbdmaitre kernel: Oops: 0000 [17] SMP
Oct 5 20:52:06 sgbdmaitre kernel: CPU 0
Oct 5 20:52:06 sgbdmaitre kernel: Pid: 28595, comm: postmaster
================================================================================================================
Voilà . Je ne sais pas quoi faire. Postgres aurait-il des faiblesses lors de fortes charges ? Ou Linux ? Aurais-je un problème matériel ?
J'avoue être un peu désemparé.
Nb: il y onze autres ordinateurs dans notre infrastructure, et seul celui hébergeant PostgreSQL (et rien d'autre) pose problème.
Toute aide serait très appréciée.
Cordialement,
JĂ©rĂ´me