[OpenSIPS-Users] Failed to replicate Dialog

Liviu Chircu liviu at opensips.org
Tue Jun 9 18:41:04 CEST 2015


Hello Kneeoh,

Finally managed to replicate these errors on my own setup. In my case, 
the cause was insufficient shared memory for the _primary_ OpenSIPS 
instance, which MAY end up with some missing data within the dialog 
module structures, and unfortunately it gets replicated that way.

Recommendation:
Please make sure you always have enough shared memory ("-m" and "-M" 
command line parameters, or variables from /etc/default/opensips). For 
each 1K calls/sec with tm+dialog and 60s duration you need roughly 640MB 
of shared memory. Regarding pkg memory (-M parameter), just use "-M16" 
and you should be fine.

Best regards,

Liviu Chircu
OpenSIPS Developer
http://www.opensips-solutions.com

On 04.06.2015 22:03, Kneeoh wrote:
> I just popped up to 1.11.5 and am still getting a stream of dialog 
> replication failure even though the non-active host IS listening on 
> the same socket as the primary host. I'm banging my head on the desk, 
> I can't figure out what this isn't working.
>
> Host 2 (passive host)
> Jun  4 18:34:50  /usr/local/sbin/opensips[27448]: 
> ERROR:dialog:receive_binary_packet: Failed to process a binary packet!
> Jun  4 18:34:50  /usr/local/sbin/opensips[27445]: 
> ERROR:dialog:dlg_replicated_update: dialog not found, building new
> Jun  4 18:34:50  /usr/local/sbin/opensips[27445]: 
> ERROR:dialog:dlg_replicated_create: Dialog in DB doesn't match any 
> listening sockets
> Jun  4 18:34:50  /usr/local/sbin/opensips[27445]: 
> ERROR:dialog:receive_binary_packet: Failed to process a binary packet!
>
> Netstat on Host 1
> netstat -nlp | grep opensips
> udp        0      0 192.168.30.40:5060 
> 0.0.0.0:*                           7304/opensips <---virtual ip
> udp        0      0 192.168.30.39:5060 
> 0.0.0.0:*                           7304/opensips <---virtual ip
> udp        0      0 10.1.0.41:5092 0.0.0.0:*                           
> 7304/opensips <---binary replication binding (bin_listen)
>
> Netstat on Host 2
> netstat -nlp | grep opensips
> udp        0      0 192.168.30.40:5060 
> 0.0.0.0:*                           27441/opensips <---virtual ip
> udp        0      0 192.168.30.39:5060 
> 0.0.0.0:*                           27441/opensips <---virtual ip
> udp     2176      0 10.1.0.42:5092 0.0.0.0:*                           
> 27441/opensips <---binary replication binding (bin_listen)
>
>
>
> On Thursday, May 7, 2015 1:36 PM, Kneeoh <kneeoh at yahoo.com> wrote:
>
>
> Hi Bogdan, Both Opensips hosts are set to use corosync/heartbeat to 
> failover the two IPs in our config. Both hosts are set to 
> non-localbind and opensips is explicitly listening on both of the 
> VIPs. This is why I'm confused. It seems that everything is configured 
> correctly yet I'm getting these errors on the inactive opensips instance.
>
>
>
> On Thursday, May 7, 2015 1:05 PM, Bogdan-Andrei Iancu 
> <bogdan at opensips.org> wrote:
>
>
> Hi Kneeoh,
>
> The dialog replication is done assuming that both opensips servers do 
> share the listening interface (via vrrp, heartbeat, etc). Do you 
> different listening IPs on the 2 opensips instances ?
>
> Regards,
> Bogdan-Andrei Iancu
> OpenSIPS Founder and Developer
> http://www.opensips-solutions.com  <http://www.opensips-solutions.com/>
> On 29.04.2015 20:35, Kneeoh wrote:
>> Hello, I've got two VIPs on two instances of opensips and am doing 
>> dialog replication. I'm getting a steady stream of failed to 
>> replicate dialog errors in my opensips log.
>>
>> 192.168.30.39
>> 192.168.30.40
>> are the two VIPs. Both have a listen = on both opensips configs. I'm 
>> not sure if this line in the log is the problem but it looks like it: 
>> " DBG:core:bin_pop_str: Popped: '' [0]" I'm not sure how the receive 
>> IP could be an empty string.
>>
>> debug:
>>
>>  DBG:dialog:dlg_replicated_create: Received replicated dialog!
>>  DBG:core:bin_pop_str: Popped: 'udp:192.168.30.40:5060' [22]
>>  DBG:core:grep_sock_info: checking if host==us: 13==13 &&  
>> [192.168.30.40] == [192.168.30.39]
>>  DBG:core:grep_sock_info: checking if port 5060 matches port 5060
>>  DBG:core:grep_sock_info: checking if host==us: 13==13 &&  
>> [192.168.30.40] == [192.168.30.40]
>>  DBG:core:grep_sock_info: checking if port 5060 matches port 5060
>>  DBG:core:bin_pop_str: Popped: '' [0]
>>  ERROR:dialog:dlg_replicated_create: Dialog in DB doesn't match any 
>> listening sockets
>>  DBG:dialog:destroy_dlg: destroing dialog 0x7f09ddd9f958
>>  DBG:dialog:destroy_dlg: dlg expired or not in list - dlg 
>> 0x7f09ddd9f958 [2225:721583693] with clid 
>> 'f4f2446c-6937-1233-f798-0024e869f1eb' and tags 'NULL' 'NULL'
>>  ERROR:dialog:receive_binary_packet: Failed to process a binary packet!
>>
>>
>>
>> _______________________________________________
>> Users mailing list
>> Users at lists.opensips.org  <mailto:Users at lists.opensips.org>
>> http://lists.opensips.org/cgi-bin/mailman/listinfo/users
>
>
>
>
>
>
>
> _______________________________________________
> Users mailing list
> Users at lists.opensips.org
> http://lists.opensips.org/cgi-bin/mailman/listinfo/users

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.opensips.org/pipermail/users/attachments/20150609/8ed4b265/attachment.htm>


More information about the Users mailing list