WO2003069473A1 - A method and apparatus for reconfiguring a server system - Google Patents

A method and apparatus for reconfiguring a server system Download PDF

Info

Publication number
WO2003069473A1
WO2003069473A1 PCT/SE2002/001279 SE0201279W WO03069473A1 WO 2003069473 A1 WO2003069473 A1 WO 2003069473A1 SE 0201279 W SE0201279 W SE 0201279W WO 03069473 A1 WO03069473 A1 WO 03069473A1
Authority
WO
WIPO (PCT)
Prior art keywords
server
servers
configuration
data
new
Prior art date
Application number
PCT/SE2002/001279
Other languages
French (fr)
Inventor
Stefan Willehadson
Anders Danne
Max Blommé
Original Assignee
Telefonaktiebolaget L M Ericsson (Publ)
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Telefonaktiebolaget L M Ericsson (Publ) filed Critical Telefonaktiebolaget L M Ericsson (Publ)
Priority to JP2003568530A priority Critical patent/JP4098723B2/en
Priority to GB0416215A priority patent/GB2400951B/en
Priority to US10/504,086 priority patent/US7739364B2/en
Priority to AU2002345485A priority patent/AU2002345485A1/en
Publication of WO2003069473A1 publication Critical patent/WO2003069473A1/en

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for program control, e.g. control units
    • G06F9/06Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
    • G06F9/46Multiprogramming arrangements
    • G06F9/50Allocation of resources, e.g. of the central processing unit [CPU]
    • G06F9/5005Allocation of resources, e.g. of the central processing unit [CPU] to service a request
    • G06F9/5027Allocation of resources, e.g. of the central processing unit [CPU] to service a request the resource being a machine, e.g. CPUs, Servers, Terminals
    • G06F9/505Allocation of resources, e.g. of the central processing unit [CPU] to service a request the resource being a machine, e.g. CPUs, Servers, Terminals considering the load
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for program control, e.g. control units
    • G06F9/06Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
    • G06F9/46Multiprogramming arrangements
    • G06F9/461Saving or restoring of program or task context
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for program control, e.g. control units
    • G06F9/06Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
    • G06F9/46Multiprogramming arrangements
    • G06F9/465Distributed object oriented systems
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for program control, e.g. control units
    • G06F9/06Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
    • G06F9/46Multiprogramming arrangements
    • G06F9/466Transaction processing
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F2209/00Indexing scheme relating to G06F9/00
    • G06F2209/50Indexing scheme relating to G06F9/50
    • G06F2209/5016Session

Abstract

A method and apparatus for dynamically reconfiguring a server system (200). When a new server configuration is applied, a previous server configuration is saved. When receiving service requests, a first server (20) is selected by using a first scheduling algorithm according to the new configuration. If the selected server was incorrect, a second server (204) is selected by using a second scheduling algorithm according to the saved previous configuration. User data associated with the received service request is moved from the second server, if correct, to the first server. In this way, stored data will be gradually and dynamically re-organised from the old configuration to the new configuration in a relatively simple way.

Description

A METHOD AND APPARATUS FOR RECONFIGURING A SERVER SYSTEM
TECHNICAL FIELD
The present invention relates generally to a method and apparatus for reconfiguring a server system comprising a plurality of servers for providing communication services. In particular, the server system is reconfigured dynamically such that shut-down periods for one or more servers can be avoided or at least be minimised.
BACKGROUND OF THE INVENTION AND PRIOR ART
A multitude of different fixed and mobile telecomunication/datacommunication services have been developed, in addition to traditional voice calling and short text messaging. For example, Internet browsing has rapidly become very popular, and in recent years the wireless domain has converged with the Internet . Mobile terminals are now available having functionality for connecting to the Internet over a. wireless access network to obtain information and services from sites and servers located anywhere throughout the world.
Moreover, new technologies for mobile communication are introduced, providing greater network capacity and higher transmission bit rates. In particular, GPRS (General Packet Radio Service) and CDMA (Wideband Code Division Multiple Access) networks are currently emerging for enabling wireless data services that require a wide range of different data rates. The data communicated in many new services may include voice, text, images, audio files and video files in various different formats and combinations. By way of example, mobile instant messaging, commonly known as "chatting" , and presence services are rapidly becoming popular. Instant messaging is known from the world of fixed PCs (Personal Computers) , including message status reporting and various group and contact list features . Presence services involve information on the location of mobile terminals and enable users to receive messages according to their profile and availability. A user profile can be personal and may be defined by preferences, interests and hobbies, as well as more temporary factors, such as user availability and current moods . Messages and content services can also be delivered depending on the present location, availability and terminal capabilities. It can be readily understood that such services require the storage of considerable amounts of retrievable user-specific data, which in many cases need to be frequently updated due to their dynamic nature .
The demands for telecommunication services are thus increasing rapidly, and service providers are established all over the world, equipped with hardware and software resources to meet these demands. In particular, means for processing service requests and data, as well as means for storing huge amounts of data are needed. Consequently, a service provider must be able to efficiently control the processing and storing means which typically comprise a system of different service components such as servers. The expression "server" will be used hereafter to represent any hardware and/or software for storing and/or processing data. A server may be configured to provide one or more specific services.
For an Internet service provider or the like controlling a plurality of servers, processing and storing load must be distributed over the servers . This is necessary in order to efficiently utilise available computing and storing resources, and to handle hotspots and avoid bottlenecks. As mentioned above, large amounts of data must be stored and should also be easy to find and retrieve.
As seen from the examples of services given above, different types of stored data may be of a very dynamic nature, needing frequent updating. Moreover, server systems must be reconfigured from time to time as the processing and storing load change, e.g., due to changing demands of service requests, added or removed subscribers and the introduction, modification or deletion of services. The workload on servers may increase rapidly so that individual servers are easily overloaded, at least for a short time, in particular popular web servers . To overcome overloading problems in servers; basically two solutions are available. Firstly, an existing server may be upgraded to increase its computing and/or storing capabilities. However, the server will soon become overloaded again if the amount of service requests and/or needs for data storage continue to increase. Further upgrading is then required, which can be complex and costly to perform.
Secondly, it is possible to add further servers to meet a higher load. The concept of virtual servers has been proposed to provide load sharing between plural servers . A virtual server is a scalable server built on a cluster of real servers, which is transparent to end users such that the users see only a single virtual server. The front-end of the real servers is a node, sometimes referred to as a "load balancer" , configured to schedule service requests to the different real servers. Incoming service requests may involve processing tasks and storing tasks to be performed by the servers. Scalability can thus be achieved by transparently adding or removing servers in the cluster. However, it is a problem to efficiently distribute processing and storing tasks between a plurality of servers, yet enabling easy retrieval of stored data. "Processing tasks" may involve analysing service requests, processing of data and running certain applications for delivering requested services. "Storing tasks" may involve storing new client-specific, session-specific or configuring data, updating already stored data, and retrieving stored data. For example, a service request may require the retrieval of certain data which is used as input for executing a specific processing task or service application. Client data and session data is hereafter collectively referred to as "user data" .
In current solutions involving the distribution of processing and storing tasks, a server is often allocated to a client upon a login request . The allocation scheme used for selecting a server is normally based on the current load on a predetermined set of servers, such that the server having the lowest current load, with respect to memory resources and/or CPU (Central Processing Unit) capability, etc, is selected for a client or session. Server allocation is typically performed by using a central load manager node or the like. Stored user data must then be retrievable, i.e. it must be possible to' find the server in which the searched data was stored.
The most simple current solution for selecting a server is to use a "Round Robin" allocation scheme. Further load sharing solutions are known which are more complex, such as "Weighted Round Robin" , "Least Connection" , "Weighted Least Connection" , "Locality Based Least
Connection", "Destination Hashing" and "Source Hashing". However, the solutions mentioned above are relatively complex to use, resulting in problems related to supervision, operation and maintenance, since it is difficult to predict where data will be distributed and stored. Furthermore, it may be difficult to find and retrieve data being stored in one or more servers if no reference or pointer to the data is stored as well . A client performing a login may have a proper reference to the data, but no other client or device can find and retrieve the data without the reference, unless so-called "brute force searches" are used among a set of servers .
"Round Robin" scheduling is only suitable for distributing processing load, since processing tasks are not affected by in which server they are performed. On the other hand, retrieving stored data in one of more servers cannot be done by using Round Robin but requires the use of specific pointers or references as described above. Furthermore, a common basic problem with some of the other scheduling methods mentioned above is that they use IP (Internet Protocol) addressing for scheduling. Since a plurality of clients can reside behind a single IP address (proxy, NAT, etc.), these can neither be used for data distribution nor load sharing.
Storing tasks' are therefore preferably scheduled by means of a hashing algorithm using some client or session identity as input . The hashing algorithm then provides a number identifying a selected server. If the same hashing algorithm is used again when retrieving data on a later occasion for the same client or session, the same server will be selected, provided that the servers have not been reconfigured since the searched data was stored. Thereby, the need for using specific pointers or references is eliminated.
However, if the number of servers and/or the identities of individual servers are changed in a reconfiguration operation, the hashing algorithm will most probably not provide the correct server identity anymore, if user data for a specific client or session was stored before the reconfiguration was made. A current solution for avoiding this problem is to shut down the server system from service requests for modifying the hashing algorithm and moving stored user data between servers, such that server selection becomes correct . This is a quite cumbersome and time consuming operation, which is undesirable since service requests cannot be attended meanwhile, resulting in lost revenue for the service provider. Moreover, the reconfiguration operation is relatively complex involving a considerable risk for errors .
SUMMARY OF THE INVENTION The object of the present invention is to reduce or eliminate the problems outlined above, and to provide dynamic and efficient reconfiguration of a server system during simultaneous handling of service requests . Thereby, shut-down periods for One or more servers can be avoided, or at least be minimised.
These objects and others are obtained by providing a method and apparatus for dynamically reconfiguring a server system. The server system comprises a plurality of servers being capable of performing at least one common storing task, and a scheduling unit configured to select servers for handling incoming service requests . In the inventive method and apparatus, a new server configuration is applied and a previous server configuration is saved. When a service request is received, a server is selected 'for handling the received request, by using a first scheduling algorithm according to the new configuration. If the selected server was incorrect, a server is selected in a second attempt by using a second scheduling algorithm according to the saved previous configuration. If the correct server was selected by the second scheduling algorithm, user data associated with the received service request is moved from the selected server to the server selected by the first scheduling algorithm. If the selected server was incorrect also in the second attempt, a server can be further selected by using a scheduling algorithm according to a saved even earlier configuration.
Preferably, the scheduling algorithms are hashing algorithms . Using the hashing algorithms may include deriving a hash number from a user ID and calculating a server ID number from the derived hash number. The following algorithm may the be used:
server ID = hash (user ID) /modulo n (1)
where n is the number bf possible servers, and the modulo operator providing an integer between 0 and n-1. The new configuration may involve a changed number of servers, and in the second scheduling algorithm, n is then changed accordingly from the first scheduling algorithm.
The service requests may involve storing tasks such as storing new data, updating already stored data, and retrieving stored data. Applying a new server configuration may further include storing new configuration data in a central administrator or the like, which can be accessed by the servers. Each server may comprise a local administrator or the like, being adapted to receive new configuration data from the central administrator. New configuration data can be pushed from the central administrator to the servers, or be retrieved by sending a request from a server. The central administrator may comprise means for pushing new configuration data to one or more servers. Further, the local administrator may comprise means for sending a request for new configuration data to the central administrator.
The inventive procedure may further be executed by means of a computer program comprising a software code being adapted to perform the method in a server system.
The present invention is capable of simultaneously handling a dual configuration of a server system. When the system is to be reconfigured by adding and/or removing one or more servers, the previous old configuration is saved. Thereby, requested data can be retrieved by using the old configuration, if the requested data could not be found by using the new configuration.
Stored data is gradually and dynamically reorganised from the old* configuration to the new one with a minimum of impact on the system operation. A better distribution of data may also be achieved when reconfiguring a server system, since data associated with new users or sessions can be efficiently distributed on existing servers as well as on any new added servers . BRIEF DESCRIPTION OF THE DRAWINGS
The present invention will now be described in more detail and with reference to the accompanying drawings, in which: Fig. 1 is a schematic overview of a communication scenario in which the present invention can be used. Fig. 2 is a block diagram of an exemplary server system according to one embodiment .
Fig. 3 is a flow chart illustrating a procedure for handling a service request .
Fig. 4 is a block diagram illustrating centralised administration of configuration data.
DESCRIPTION OF PREFERRED EMBODIMENTS In Fig. 1, a schematic communication scenario is illustrated in which the present invention can be used. A plurality of client terminals 100 are connected to a backbone network 102, such as the Internet. The client terminals 100 may be fixed or mobile, such as wired terminals, or wireless terminals connected to a mobile access network 104 over an air interface, as indicated in the figure. A plurality of service providers 106 are likewise connected to the backbone network 102, each comprising one or more1 servers for executing telecommunication services requested for the client terminals' 100. In reality, various further networks and/or nodes are typically involved in communication routes between clients and service providers, although not shown here for the sake of simplicity. A client terminal 100 may initiate a specific telecommunication service by sending a service request over the backbone network 102 to a service provider 106. The contacted service provider then activates one or more suitable service applications in response thereto. Activating a service application may involve various processing and storing tasks, as performed in one or more servers. A service application may also be triggered without a preceding terminal request, such as through a "push" mechanism as used for example in the context of WAP (Wireless Application Protocol) . A service request may thus originate from another service provider or network operator, e.g. needing user data of a certain client or ongoing session. For example, a service provider may want to transmit certain information to mobile stations as they enter a specific area. In that case, the service provider will request for needed user data, such as terminal capabilities and client profile, e.g., including predefined preferences and interests .
As mentioned earlier, service providers are typically equipped with a plurality of servers in order to handle service requests from clients and other service providers. Thus, the same functionality is duplicated in several servers, thereby being capable of performing the same or similar service tasks simultaneously for plural clients, be it processing or storing tasks. The present invention is concerned* with a method and apparatus for enabling efficient reconfiguration of plural servers such that any shut-down periods can be avoided or at least be minimised.
In Fig. 2, a simplified server system 200 is , illustrated for providing one or more telecommunication services for client terminals. The server system 200 comprises a scheduling unit 202 for scheduling incoming service requests involving one or more storing tasks . The scheduling unit 202 is connected to a set of servers 204 being capable of performing at least one common storing task. The servers 204 may in addition be capable of performing other tasks or operations as well, which is however not within the scope of the present invention. The servers 204 may further be designated for different service applications, and may comprise Session servers, Instant Messaging servers, Presence servers, etc.
The server system 200 may also include further components such as access nodes and different servers designated for different request handling operations . However, the functionality of these further components are generally outside the scope of the present invention, which are therefore not described here . The scheduling unit 202 may reside in an access node of a service provider, receiving service requests from clients, service providers or network operators. The scheduling unit 202 may also reside in a processing server being configured to forward certain service requests to other servers for further treatment .
The scheduling unit 202 is configured to use scheduling algorithms for .selecting specific servers 204 for incoming service requests, using some predefined client or session identity as input. A scheduling algorithm then provides a number identifying a selected server. The scheduling algorithm provides that when stored data for a client or session is retrieved, the correct server will be selected, if the number of servers 204 and their identities are the same as when the searched data'was stored. Further, if a storing task involves storing new user data, the scheduling algorithm will provide that the new data is stored in the same server as old user data was stored, thereby collecting all user data for a specific client or session in the same server.
The scheduling algorithms used are preferably hashing algorithms . A hashing algorithm means that a hash number is derived from a predetermined identity code of a client or a session, which will hereafter be collectively referred to as a "user ID" . A server identity can be determined by calculating a corresponding server ID number by means of a predetermined algorithm or formula based on the derived hash number. For example, the hash number can be a simple checksum or the like for a binary code of the user ID. According to one example, a server 204 is determined to perform a storing task from the following algorithm:
server ID = hash (user ID) modulo n (1)
where n is the number of possible servers in the set . The modulo operator will provide an integer between 0 and n-1. For example, if four secondary servers 204:0 - 204:3 are available, i.e. n = 4, and one particular client gives hash (user ID) = 14, the server ID = 2. Thus, server 204:2 is selected accordingly. For .another client, hash (user ID) = 16, leading to server ID = 0, and so forth. Any suitable hashing function may be used on a user ID, provided that the same server ID is always derived from the same user ID. In this way, storing load can be distributed in the set of servers 204.
However, if the server system needs to be reconfigured by changing the number of servers, the used hashing algorithm must also be modified in order to distribute storing load between the new number of servers. Thus, it may be necessary to add or remove one or more servers due to, e.g., trends of service demands, or changes in subscribers and offered services.
For example, if the above-described hashing algorithm (1) is used, n must be changed according to the new number of servers. Thereby, inputting a user ID in the modified hashing algorithm will most probably provide a different server ID than before .
In previous solutions, a reconfigured server system is shut down for a period of time in order to move storage of user data, or to introduce a specific pointer or reference to the relevant storage place for each affected data entry in the concerned servers . It can be readily understood that this is a tedious and time consuming procedure . Instead, the present invention provides a dynamic solution allowing simultaneous operation of handling service requests, 'thereby avoiding unwanted shutdown periods. When introducing a new server configuration, the hashing algorithm is modified according to the new configuration, and the old configuration is saved in a memory 206 in the scheduling unit 202. Saving the old configuration includes saving at least the old hashing algorithm and its parameters, and possibly also information on the server system structure of the old configuration. When receiving a service request involving a storing task for a client or session as identified by a user ID, the scheduling unit 202 makes a first attempt to find the correct server by using the modified new hashing algorithm. If the expected server is not found, i.e. the server where searched data or old user data was stored, the old hashing algorithm according to the saved old configuration is used in a second attempt. When the correct server is found and selected, for the storing task, data associated with the concerned client or session is moved from that server to a new server corresponding to the new hashing algorithm. If the new hashing algorithm happens to select the same server as the old hashing algorithm, the data will of course remain there.
In this way, already stored data can be gradually moved to correct servers in accordance with the new hashing algorithm whenever service requests are received for different clients or sessions. In the course of time, a growing share of the total data amount will end up in correct servers according to the new configuration. Finally, it may be appropriate to shut down the server system during a limited period of time for changing storage of any remaining user data which has not yet been moved. The old configuration can then be deleted from the memory 206.
It should be noted that 'it is possible to use more than two configuration versions simultaneously in a server system, within the scope of the present invention. Thus, further previous configurations of even earlier versions may be saved in the memory 206. In that case, more than two attempts can be made for finding the correct server, by using saved even earlier configuration versions. An exemplary procedure of dynamically reconfiguring a server system will now be described with reference to a flow chart shown in Fig. 3, occasionally also referring to Fig. 2. It is assumed in this example that the number of servers has been changed and a new hashing algorithm has replaced a previous one, which has been saved in a memory 206. In a first step 300, a service request is received in a scheduling unit 202. It is detected that a storing task must be executed for the service request in one of the servers 204, such as retrieving already stored user data. In a step 302, a first attempt is made to find the correct server where the searched data is stored. The new hashing algorithm is then applied to select a server for the received service request, using a corresponding user ID as input . The user ID can normally be extracted from the received service request .
Next in a step 304, it is determined whether the correct server was found and selected in step 302 by the new hashing algorithm. If so, the storing task can be executed accordingly in- a step 306, after which the procedure may return to the first step 300 for receiving a next service request. If it is determined in step 304 that the selected server was incorrect, a second attempt is made to find the correct server in a step 308, by applying a saved previous hashing algorithm on the received service request .
It is then determined in a next step 310, whether the correct server was found and selected in step 308 by the previous hashing algorithm. If so, user data associated with the service request is moved, in a step 312, from the old server to the new server selected in step 302 by the new hashing algorithm. The moved data may embrace all user data of a client or session for which the service request was directed, even if the 'storing task to be performed is concerned with only some of that data.
The storing task is also executed accordingly in a step 314, after which the procedure may return to the first step 300 for receiving a next service request. It should be noted that steps 312 and 314 can be performed in reverse order, if suitable. For example, it may be efficient to first retrieve searched data in a storing task before moving the data to another server. If it was determined in step 310 that the correct server was still not selected in step 308, it may be determined in a step 316 whether a new attempt should be made to find the correct server. If so, the procedure returns to step 308 for applying a further previous hashing algorithm, which may be an even older saved hashing algorithm, as explained above. If determined in step 316 that no new attempt is to be made, e.g. due to time-out or count-out, the storing task cannot be performed for the received service request . A new request may then be received by returning to the first step 300.
As mentioned above, reconfiguring a server system may involve changes in offered services. For example, new services may be added and present services may be modified or omitted. In a system with plural servers, such reconfiguration requires that all servers are updated with the new configuration. In previous solutions, relevant new configuration data is stored in all servers. In the present invention, all configuration data may instead be stored in a central administrator or the like, which can be accessed from the servers .
In Fig. 4, a schematic block diagram illustrates a centralised administration of configuration data. A group of servers 400 is shown, 'each being capable of handling at least one common service request, such as the servers 204 in Fig. 2. Each server 400 comprises a local administrator 402 which is connected to a central administrator 404 and having functionality for accessing configuration data there from. The central administrator 404 may be a separate node in the server system or may reside in an access node or in a server, e.g. together with a scheduling unit 202 as described above. The servers 400 may need access to certain configuration data in order to invoke requested services. However, no such configuration data is stored locally in the servers according to one embodiment . Instead, all configuration data being valid for the servers 400 is stored in the central' administrator 404. Thus, any configuration data required for invoking a requested service in a server 400 can be retrieved from the central administrator 404 by sending a request for the configuration data from its local administrator 402. Only a reference to the central administrator 404 is needed in the local administrators 402. Configuration data can also be "pushed" to one or more local administrators 402 from the central administrator 404, i.e. without a preceding request from the local administrators 402.
By this arrangement, configuration data is efficiently' stored in only one place instead of being duplicated in the servers 400. Any updating of configuration data can easily be done in the central administrator 404, which will have an accurate overview of all current services and servers . Errors and conflicts between servers having different configuration versions can thereby be avoided. Moreover, the need for synchronous updates in plural servers is eliminated. The present invention may be used to great advantage in telecommunication services defined in the context of "Wireless Village Server" , such as those relating to Instant Messaging, Presence Information and Shared Content . By using the invention in servers for handling such service requests, a dynamic and efficient reconfiguration is achieved for those servers. However, the present invention is also applicable to any scalable systems for data distribution. By using the invention, servers can thus be dynamically added and/or removed without shutting down the server system, at least for extensive time periods. In this way, a high level of service can be maintained. In particular, the present invention can advantageously be used in a server system for distributing data storage and processing load between plural servers which is described in Applicant's co-pending PCT application . In this solution, a primary server for performing a processing task is assigned using a first scheduling algorithm, which is capable of selecting any primary server. A secondary server for performing a storing task is assigned using a second scheduling algorithm, which is capable of selecting a specific secondary server corresponding to a client involved in that storing task.
While the invention has been described with reference to specific exemplary embodiments, the description is only intended to illustrate the inventive concept and should not be taken as limiting the scope of the invention. Various alternatives, modifications and equivalents may be used without departing from the spirit of the invention, which is defined by the appended claims .

Claims

1. A method of dynamically reconfiguring a server system comprising a plurality of servers for handling incoming service requests, characterised by the steps of:
A) - applying a new server configuration,
B) - saving a previous server configuration, wherein the following steps are executed when receiving service requests: C) - selecting a server for handling a received service request, by using a first scheduling algorithm according to the new configuration,
D) - selecting a server by using a second scheduling algorithm according to the previous configuration saved in step B) , if the server selected in step C) was incorrect , and
E) - moving user data associated with the received service request from the server selected in step D) , if correct, to the server selected in step C) .
2. A method according to claim 1, characterised by the further step of selecting a server by using a scheduling algorithm according to a saved even earlier configuration, if the server selected in step D) was also incorrect .
3. A method according to claim 1 or 2 , characterised in that the scheduling algorithms are hashing algorithms.
4. A method according to claim 3 , characterised in that using the hashing algorithms includes deriving a hash number from a user ID and calculating a server ID number from the derived a hash number.
5. A method according to claim 4 , characterised by using the following algorithm:
server ID = hash (user ID) modulo n (1)
where n is the number of possible servers, and the modulo operator providing an integer between 0 and n-1.
6. A method according to claim 5, characterised in that the new configuration involves a changed number of servers, and that in the second scheduling algorithm, n is changed accordingly from the first scheduling algorithm.
7. A method according to any of claims 1 - 6, characterised in that the service requests involve storing tasks.
8. A method according to claim 7, characterised in that the storing tasks include any of: storing new data, updating already stored data, and retrieving stored data.
9. A method according to any of claims 1 - 8, characterised in that step A) of applying a new server configuration includes storing new configuration data in a central administrator or the like, which can be accessed by the servers .
10.A method according to claim 9, characterised in that new configuration data is pushed from the central administrator to one or more servers.
11.A method according to claim 9, characterised in that new configuration data is retrieved from the central administrator by sending a request from a server.
12.An apparatus for reconfiguring a server system, the server system comprising a plurality of servers being capable of performing at least one common storing task, and a scheduling unit configured to select servers for handling incoming service requests, characterised by:
- means for applying a new server configuration,
- means for saving a previous server configuration,
- means for selecting a first server for handling a received service request, by using a first scheduling algorithm according to the new configuration, and
- means for selecting a second server by using a second scheduling algorithm according to the previous configuration, if the first server was incorrect, and
- means for moving user data associated with the received service request from the second server, if correct, to the first server.
13.An apparatus according to claim 12, characterised by
1 means for selecting a server by using a scheduling algorithm according to an even earlier configuration, if the second server was also incorrect .
14.An apparatus according to claim 12 or 13, characterised by means for using hashing algorithms as scheduling algorithms .
15.An apparatus according to claim 14, characterised by means for deriving a hash number from a user ID and calculating a server ID number from the derived a hash number.
16. An apparatus according to claim 15, characterised by means for using the following algorithm:
server ID = hash (user ID) modulo n (1)
where n is the number of possible servers, and the modulo operator providing an integer between 0 and n-1.
17.An apparatus according to any of claims 12 - 16, characterised by a central administrator or the like for storing new configuration data, which can be accessed by the servers .
18.An apparatus according to claim 17, characterised by a local administrator or the like in each server for receiving new configuration data from the central administrator.
19.An apparatus according to claim 17 or 18, characterised in that the central administrator comprises means for pushing new configuration data to one or more servers.
20.An apparatus according to claim 18, characterised in that the local administrator comprises means for sending a request for new configuration data to the central administrator .
21.A computer program comprising a software code being adapted to perform the method of any of claims 1 - 11 in a server system.
PCT/SE2002/001279 2002-02-13 2002-06-27 A method and apparatus for reconfiguring a server system WO2003069473A1 (en)

Priority Applications (4)

Application Number Priority Date Filing Date Title
JP2003568530A JP4098723B2 (en) 2002-02-13 2002-06-27 Method and apparatus for reconfiguring server system
GB0416215A GB2400951B (en) 2002-02-13 2002-06-27 A method and apparatus for reconfiguring a server system
US10/504,086 US7739364B2 (en) 2002-02-13 2002-06-27 Method and apparatus for dynamically reconfiguring a server system
AU2002345485A AU2002345485A1 (en) 2002-02-13 2002-06-27 A method and apparatus for reconfiguring a server system

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
SE0200417A SE0200417D0 (en) 2002-02-13 2002-02-13 A method and apparatus for reconfiguring a server system
SE0200417-4 2002-02-13

Publications (1)

Publication Number Publication Date
WO2003069473A1 true WO2003069473A1 (en) 2003-08-21

Family

ID=20286945

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/SE2002/001279 WO2003069473A1 (en) 2002-02-13 2002-06-27 A method and apparatus for reconfiguring a server system

Country Status (7)

Country Link
US (1) US7739364B2 (en)
JP (1) JP4098723B2 (en)
AU (1) AU2002345485A1 (en)
ES (1) ES2264627B2 (en)
GB (1) GB2400951B (en)
SE (1) SE0200417D0 (en)
WO (1) WO2003069473A1 (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2007272909A (en) * 2003-11-10 2007-10-18 Eath:Kk Compilation system
US8086709B2 (en) 2005-04-04 2011-12-27 Telefonaktiebolaget L M Ericsson (Publ) Method and apparatus for distributing load on application servers

Families Citing this family (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8103748B2 (en) * 2002-05-20 2012-01-24 International Business Machines Corporation Rule-based method and system for managing heterogenous computer clusters
US7636941B2 (en) 2004-03-10 2009-12-22 Microsoft Corporation Cross-domain authentication
US7934216B2 (en) * 2005-10-03 2011-04-26 International Business Machines Corporation Method and system for load balancing of computing resources
US7734296B2 (en) * 2006-05-31 2010-06-08 Cisco Technology, Inc. Management of distributed location servers
US20100125619A1 (en) * 2008-11-20 2010-05-20 Microsoft Corporation Deterministic selection of domain controllers in a multi-master database distributed directory service
JP4957765B2 (en) * 2009-08-24 2012-06-20 日本電気株式会社 Software program execution device, software program execution method, and program
JP5403446B2 (en) * 2010-03-20 2014-01-29 株式会社Pfu Virtual machine management apparatus, virtual machine management system, virtual machine management method, and program
US10069887B2 (en) * 2011-01-04 2018-09-04 Thomson Licensing Dtv Apparatus and method for transmitting live media content
FR2978848B1 (en) * 2011-08-02 2013-08-30 Viaccess Sa METHOD FOR SMOOTHING THE WORKING LOAD OF A SERVER
CN106575267B (en) * 2014-08-29 2018-04-06 三菱电机株式会社 Handle distributor and data handling system
US10778660B1 (en) * 2016-09-21 2020-09-15 Amazon Technologies, Inc. Managing multiple producer consumer—systems with non-identical idempotency keys
US11843660B2 (en) * 2018-05-28 2023-12-12 Nippon Telegraph And Telephone Corporation Transfer control device, transfer control method, service provision system, and transfer control program
CN111614726B (en) * 2020-04-24 2023-03-24 深圳震有科技股份有限公司 Data forwarding method, cluster system and storage medium

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO1999032956A2 (en) * 1997-12-19 1999-07-01 Holontech Corporation Dynamic load balancer for multiple network servers

Family Cites Families (15)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5200999A (en) * 1991-09-27 1993-04-06 International Business Machines Corporation Public key cryptosystem key management based on control vectors
US5794207A (en) * 1996-09-04 1998-08-11 Walker Asset Management Limited Partnership Method and apparatus for a cryptographically assisted commercial network system designed to facilitate buyer-driven conditional purchase offers
US5819030A (en) * 1996-07-03 1998-10-06 Microsoft Corporation System and method for configuring a server computer for optimal performance for a particular server type
US6240454B1 (en) * 1996-09-09 2001-05-29 Avaya Technology Corp. Dynamic reconfiguration of network servers
US7171662B1 (en) * 1998-03-18 2007-01-30 Microsoft Corporation System and method for software licensing
US6330710B1 (en) * 1998-06-19 2001-12-11 At&T Corp. Servlet-based architecture for dynamic service composition
US6728748B1 (en) * 1998-12-01 2004-04-27 Network Appliance, Inc. Method and apparatus for policy based class of service and adaptive service level management within the context of an internet and intranet
US6976063B1 (en) * 2000-11-02 2005-12-13 Microsoft Corporation Method and system for dynamically configuring a server computer
US20020107977A1 (en) * 2000-12-07 2002-08-08 Andrew Dunshea Multi-server system dynamic re-configuration
US6976065B2 (en) * 2001-02-23 2005-12-13 Sun Microsystems, Inc. Mechanism for reconfiguring a server without incurring server down time
US7921290B2 (en) * 2001-04-18 2011-04-05 Ipass Inc. Method and system for securely authenticating network access credentials for users
US6898705B2 (en) * 2001-05-31 2005-05-24 International Business Machines Corporation Automatic appliance server re-provision/re-purposing method
US7188163B2 (en) * 2001-11-26 2007-03-06 Sun Microsystems, Inc. Dynamic reconfiguration of applications on a server
US20030233446A1 (en) * 2002-06-12 2003-12-18 Earl William J. System and method for managing a distributed computing system
US8145759B2 (en) * 2002-11-04 2012-03-27 Oracle America, Inc. Dynamically configurable resource pool

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO1999032956A2 (en) * 1997-12-19 1999-07-01 Holontech Corporation Dynamic load balancer for multiple network servers

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
CHOY D.M. ET AL.: "Efficiently extendible mappings for balanced data distribution", ALGORITHMICA, vol. 16, 1996, SPRINGER-VERLAG, NEW YORK INC., pages 215 - 232, XP002958429 *
ROSS K.W.: "Hash routing for collections of shared web caches", IEEE NETWORK, November 1997 (1997-11-01) - December 1997 (1997-12-01), pages 37 - 44, XP000737464 *
VINGRALEK R. ET AL.: "SNOWBALL: Scalable storage on networks of workstations with balanced load", 1998, KLUWER ACADEMIC PUBLISHERS, MANUFACTURED IN THE NETHERLANDS, XP002958430 *

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2007272909A (en) * 2003-11-10 2007-10-18 Eath:Kk Compilation system
JP4708383B2 (en) * 2003-11-10 2011-06-22 株式会社イース Aggregation system
US8086709B2 (en) 2005-04-04 2011-12-27 Telefonaktiebolaget L M Ericsson (Publ) Method and apparatus for distributing load on application servers

Also Published As

Publication number Publication date
AU2002345485A1 (en) 2003-09-04
SE0200417D0 (en) 2002-02-13
JP2005518016A (en) 2005-06-16
GB2400951B (en) 2005-06-22
JP4098723B2 (en) 2008-06-11
US7739364B2 (en) 2010-06-15
US20050066014A1 (en) 2005-03-24
ES2264627A1 (en) 2007-01-01
ES2264627B2 (en) 2007-08-01
GB2400951A (en) 2004-10-27
GB0416215D0 (en) 2004-08-25

Similar Documents

Publication Publication Date Title
JP7252356B2 (en) MOBILE EDGE COMPUTING NODE SELECTION METHOD, APPARATUS AND SYSTEM AND COMPUTER PROGRAM
US7844708B2 (en) Method and apparatus for load sharing and data distribution in servers
US9923958B1 (en) Highly available network filer with automatic load balancing and performance adjustment
US7739364B2 (en) Method and apparatus for dynamically reconfiguring a server system
US7373644B2 (en) Automated server replication
CN101540775B (en) Method and device for distributing contents and network system for distributing contents
KR20010088742A (en) Parallel Information Delievery Method Based on Peer-to-Peer Enabled Distributed Computing Technology
CN102067094A (en) Cache optimzation
WO2004036344A2 (en) System and method for the optimization of database
KR20080005539A (en) System, network device, method, and computer program product for active load balancing using clustered nodes as authoritative domain name servers
KR20090094292A (en) Method, device and system for distributing file data
US11159625B1 (en) Efficiently distributing connections to service instances that stream multi-tenant data
KR20100060304A (en) Distributed content delivery system based on network awareness and method thereof
CN110727738B (en) Global routing system based on data fragmentation, electronic equipment and storage medium
US20220318071A1 (en) Load balancing method and related device
US11212329B2 (en) Method, apparatus, device and storage medium for pushing video stream
US7075895B1 (en) System and method for facilitating discovery of network addresses and selected charateristics of computer systems and the like which are connected in digital data networks
CN113177179B (en) Data request connection management method, device, equipment and storage medium
CN112398802B (en) Data downloading method and related equipment
US8583819B2 (en) System and method for controlling server usage in peer-to-peer (P2P) based streaming service
JP6888478B2 (en) Sorting system
JP2007272540A (en) Data distributing method and data distributing system
CN115883657A (en) Cloud disk service accelerated scheduling method and system
CN102469110A (en) Load balancing method applied to cluster system
KR100450605B1 (en) A web application sever and method for providing dynamic contents thereof

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A1

Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BY BZ CA CH CN CO CR CU CZ DE DK DM DZ EC EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NO NZ OM PH PL PT RO RU SD SE SG SI SK SL TJ TM TN TR TT TZ UA UG US UZ VN YU ZA ZM ZW

AL Designated countries for regional patents

Kind code of ref document: A1

Designated state(s): GH GM KE LS MW MZ SD SL SZ TZ UG ZM ZW AM AZ BY KG KZ MD RU TJ TM AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE TR BF BJ CF CG CI CM GA GN GQ GW ML MR NE SN TD TG

ENP Entry into the national phase

Ref document number: 0416215

Country of ref document: GB

Kind code of ref document: A

Free format text: PCT FILING DATE = 20020627

121 Ep: the epo has been informed by wipo that ep was designated in this application
DFPE Request for preliminary examination filed prior to expiration of 19th month from priority date (pct application filed before 20040101)
WWE Wipo information: entry into national phase

Ref document number: 163032

Country of ref document: IL

ENP Entry into the national phase

Ref document number: 200450045

Country of ref document: ES

Kind code of ref document: A

WWE Wipo information: entry into national phase

Ref document number: P200450045

Country of ref document: ES

WWE Wipo information: entry into national phase

Ref document number: 2003568530

Country of ref document: JP

WWE Wipo information: entry into national phase

Ref document number: 10504086

Country of ref document: US

122 Ep: pct application non-entry in european phase
WWP Wipo information: published in national office

Ref document number: 200450045

Country of ref document: ES

Kind code of ref document: A

WWG Wipo information: grant in national office

Ref document number: 200450045

Country of ref document: ES

Kind code of ref document: A