WO2008000530A1 - A method and apparatus for caching broadcasting information - Google Patents

A method and apparatus for caching broadcasting information Download PDF

Info

Publication number
WO2008000530A1
WO2008000530A1 PCT/EP2007/053659 EP2007053659W WO2008000530A1 WO 2008000530 A1 WO2008000530 A1 WO 2008000530A1 EP 2007053659 W EP2007053659 W EP 2007053659W WO 2008000530 A1 WO2008000530 A1 WO 2008000530A1
Authority
WO
WIPO (PCT)
Prior art keywords
web
objects
probability
web object
page
Prior art date
Application number
PCT/EP2007/053659
Other languages
French (fr)
Inventor
Frederic Bauchot
Gérard Marmigere
Jean-Luc Collet
Michel Porta
Original Assignee
International Business Machines Corporation
Compagnie Ibm France
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by International Business Machines Corporation, Compagnie Ibm France filed Critical International Business Machines Corporation
Priority to JP2009517052A priority Critical patent/JP2009541877A/en
Priority to EP07728124A priority patent/EP2036307A1/en
Publication of WO2008000530A1 publication Critical patent/WO2008000530A1/en

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/957Browsing optimisation, e.g. caching or content distillation
    • G06F16/9574Browsing optimisation, e.g. caching or content distillation of access to content, e.g. by caching
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L67/00Network arrangements or protocols for supporting network services or applications
    • H04L67/50Network services
    • H04L67/56Provisioning of proxy services
    • H04L67/568Storing data temporarily at an intermediate stage, e.g. caching
    • H04L67/5681Pre-fetching or pre-delivering data based on network characteristics
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L67/00Network arrangements or protocols for supporting network services or applications
    • H04L67/50Network services
    • H04L67/56Provisioning of proxy services
    • H04L67/568Storing data temporarily at an intermediate stage, e.g. caching
    • H04L67/5682Policies or rules for updating, deleting or replacing the stored data

Definitions

  • the present invention relates to a method and apparatus for caching broadcasting information, particularly but not exclusively to information presented in the form of web information .
  • IP Internet Protocol
  • IPDC IP data-cast
  • FLUTE File Delivery Over Unidirectional Transport
  • WIFI Wireless Fidelity
  • WIFI coverage tends to be very limited and is generally only used in private areas or airports and the like.
  • FIG. 1 depicts a known mobile internet infrastructure 100 and a separated Digital Video Broadcast infrastructure 102.
  • a mobile user may access both TV channels and an internet site using a mobile device 104.
  • TV channel may be broadcast using for example DVB-H in Europe.
  • a video source 106 is broadcasted using a DVB-H head-end 108 which may include a
  • a VHF/UHF signal 116 is emitted by the DVB-H head-end and is received by the mobile device 104.
  • the mobile device may be a GSM phone or a PDA enabled to process DVB-H.
  • the mobile device 150 may support conventional Web browser capabilities and can receive signals 118 from an access point 120 belonging to a GPRS/UMTS network 122 for example.
  • the GPRS/UMTS network may access the internet 124 by means of a Telco gateway 126.
  • the internet may access a web server 128 to obtain content for transmission to the user.
  • the web server 128 may also be accessed by means of a non-mobile device, for example computer 130 which may be connected via a wired line such as PSTN, ADSL, etc.
  • Another object of the present invention is to use new broadcast transmission means to transfer both DVB-H type objects and web type objects through the same media and to schedule the web type object in an intelligent manner.
  • a further object of the invention is to provide a method for broadcasting website to mobile equipment without requiring return channel means to indicate preferences, but instead to use predictions to assist with the scheduling.
  • the present invention is directed to the method and system as defined in the independent claims. More particularly the present invention discloses a method of controlling the storage of web objects in a storage media, the web objects forming a closed web site and being for delivery to a user, the web objects being stored on the basis of predictive determinations of which web object has a highest probability of being selected from the closed web site by the user, the method comprising: identifying a plurality of web objects which form the closed web site; determining a web tree of the closed web site; identifying one or more parameters associated with the or each web object or the web tree; determining from the one or more parameters of each web object a probability of the web object being selected so as to generate a list of likely web objects; ordering the likely web objects in accordance with their probability; storing the web object from the set of likely objects in order of their relative probability such that those with the highest probability are stored first; and continuing the storing step for as long as there is available capacity in the storage media .
  • parameters of a web object such as hops or jumps from a root object, size, web tree linkage information etc.
  • This new function may be added as a service and is viewed by any conventional web browser as a web proxy.
  • the proxy intercepts the request and waits for the reception of said object. As soon as the object reception is complete without error, said object is returned to the web browser and then on to the user.
  • the proxy thus stores web objects which have the highest probability of being selected by a user based on the currently viewed object.
  • a further advantage is the extension of the functionality of this proxy by caching all pages being at a maximum predefined hops count from the current displayed page.
  • These objects may be called neighboring web objects. As the cache size is limited, web object which are not part of a group of neighboring objects may be suppressed if necessary.
  • Figure 1 is a block diagram of a prior art system for a mobile internet infrastructure and a separated Digital Video Broadcast infrastructure.
  • Figure 2 is a block diagram of a Web Cast and Digital Video Broadcast infrastructure according to the present invention .
  • Figure 3 is a block diagram showing the relationship between different Web objects according to the present invention .
  • Figure 4 is a table which shows the URL mapping of the web objects according to the present invention.
  • Figure 5 is a map diagram of a complex web site, showing the interconnections.
  • Figure 6 is a URL hops table.
  • Figure 7 shows the structure of a web object as it would be transported over the medium.
  • Figure 8 shows the structure of a web client including object cache in accordance with the present invention.
  • Figure 9A shows active web objects with 1 and 2 hops around a first reference object.
  • Figure 9B shows a new active web objects with 1 and 2 hops around a second reference object.
  • Figure 10 depicts the table managed by the predictive caching function in accordance with the present invention.
  • Figure 11 depicts the process steps for managing the cache in a predictive way in accordance with the present invention. Detailed description of the preferred embodiment
  • the Figure 2 shows a web cast and digital video broadcast infrastructure, which includes a web broadcast gateway 200 in accordance with the present invention.
  • the web broadcast gateway 200 is a link between a web server 202 and the digital video broadcast environment.
  • the web broadcast gateway 200 allows a mobile user having only broadcast TV capabilities, to access a web site and to access web objects such as music, Video, images, etc..
  • the term web objects used herein is intended to include all types of information found on the web.
  • the web broadcast gateway is used to broadcast objects forming a so-called "closed web site".
  • the term closed web site defines a web site in which links found on a web page of the site, refer to an object of this web site. In other words, no reference to an "external site" may be found in a web page of a closed web site. This will be described in greater detail below.
  • the closed web site is generated in any appropriate manner. One the closed web site is formed it is broadcast from the transmitter as will be described below.
  • the web broadcast gateway includes four main elements.
  • the first element of the web broadcast gateway is a website pump 210, which periodically explores the closed web site to identify each object belonging to this site. For each object found, a unique reference, unique object identifier (OID) is determined and a record is added or updated in a table which contains all the objects constituting the closed web site. This table is formed during an interrogation phase in which the content and the relationships between objects forming the closed web site is determined. The table may be marked as complete once the interrogation phase is finished.
  • the second element of the web broadcast gateway is a website pump 210, which periodically explores the closed web site to identify each object belonging to this site. For each object found, a unique reference, unique object identifier (OID) is determined and a record is added or updated in a table which contains all the objects constituting the closed web site. This table is formed during an interrogation phase in which the content and the relationships between objects forming the closed web site is determined. The table may be marked as complete once the interrogation phase
  • Hypertext language translator 220 which modifies the URL found in the web page (HTML, WML, etc.) to a new URL which comprises the OID and optionally meta-data relating to the web page or object. This will be described in greater detail below.
  • the third element of the web broadcast gateway is a site analyzer 230 which scans all the objects of the closed web site and computes the number of hops separating any object from all other objects included in the closed web site.
  • the fourth element of the web broadcast gateway is the scheduler 240.
  • the scheduler may be in the form of a carrousel which utilises the table built by the web site pump and modified by the site analyser 230.
  • the scheduler includes a weighting associated with each object. The weighting is based on the number of references to this object in the other closed web site pages or objects, the number of "hops" to reach this object from the root page and the size of the object. Using this weighting, the carrousel is populated.
  • the web objects which are identified and stored are delivered to the user using the FLUTE protocol as will be described in greater detail below.
  • the web objects are downloaded from the web server 128 prior to being processed and stored by the web broadcast gateway 200.
  • the mobile internet structure described in figure 1 is still maintained, i.e. the Telco gateway, GPRS/UMTS and the GSM access point. However they are not used to download web objects.
  • the output from the web broadcast gateway (from the carrousel scheduler 240) is combined with the video source output in the DVB-H encapsulation module as will be described in more detail below.
  • the DVB-H broadcast occurs in bursts with video being broadcast in some bursts and data in others.
  • the carrousel scheduler sends objects in accordance with their weightings. These weightings depend on the position of the object in the closed web site.
  • the root page may be sent many times in each circulation of the carrousel, whilst object at the lowest level of the closed web site may be sent only once or twice during each circulation. Similarly objects in between may be sent a several times during each circulation of the carrousel.
  • the mobile device 104 thus only receives transmissions from the DVB-H broadcast head-end for video and web objects in accordance with the present invention.
  • Other connections are still available for other forms of technology for exampled GPRS or UMTS via link 118. However for the purposes of this invention these other connections will not be described further .
  • FIG. 3 shows web objects with their relationships within a closed web site 300.
  • the closed web site is a site where there is no reference to an external web site having references to other sites. In other words, the number of hops starting from the root page to reach the farthest web object in the site is limited to a predefined value. This can vary depending on the storage capacity, the nature of the tree (i.e. how many pages at each hop) and the individual page sizes etc. If there are external references (i.e. outside the closed web site) these can be ignored and/or generate an appropriate message for the user to indicate that the external reference is not available.
  • the root page 310 (Index.htm) has four links.
  • Each of the pages on the second level (Page 1, Page 2 and Page 3) have links to pages at a third level which are two hops from the root page 310.
  • Page2.htm is referenced two times in root page (index.htm) 310 and in pagel.htm 350 and this can be reached from different routes.
  • the third level of pages (pages 11, 12, 21, 22, 31 and 32) is the final level in this example.
  • the tree could include more or less than three levels of pages.
  • a file (or object) VIDEOl. mpg can be accessed in a number of different locations (e.g. page 1, page 21, page 22 and page 31) .
  • AUDIOl. mp3 can be accessed on pages 2, 11 and 21. This gives flexibility to the user in accessing web objects or files and will assist in minimising delays in the user viewing specific pages. With knowledge of the web object tree and the various jumps, hops or steps between respective pages a table can be created which will be described shortly with reference to figure 4. This enables the system of the present invention to operate hypertext language translations and to optimise storage.
  • the Figure 4 shows a URL Mapping table 400 according to the present invention in which each entry represents an object of the closed web site shown in figure 3.
  • the table includes a number of columns in which the headings are identified and described below.
  • a page name 410 is shown in column 1 and is the name of the object as defined in the web server. This name may be prefixed by a hierarchy of directories which are under the Web home directory, or in any other appropriate manner.
  • An object type 430 is shown in column 2 and indicates the Web object type. Examples include "Text” for html or wml pages; "image” for Jpeg, gif, tiff or any other image or photo files; "audio” for mp3, wma, avi or any other audio or music file formats; "video” for mpeg, avi, QuickTime movie or any other video file formats.
  • OID object identifier
  • a field indicating usage 450 is also identified and populated. Although all objects are broadcast, some objects may not be usable as they are received with control access protection. To make these objects usable, the user (consumer) may request a license. Usage "on demand” is associated with these protected objects, while usage “broadcast” is used for all other objects.
  • a root hop count 460 indicates the number of pages separating the object from the root page. This information will be used to compute the object weight for the carrousel scheduling .
  • a reference count 470 is the number of references pointing to an object in all the pages in the closed web site. This information is also used in the computation of the object weight for the carrousel scheduling.
  • a size indicator 480 of object gives the size in Kbytes and is also used in the computation of the object weight for the carrousel scheduling as will be described below.
  • Figure 5 shows a complete web tree for a small closed web site 500 used as an example in this invention.
  • the web object with the OID reference 100 is the root page.
  • Neighboring pages or objects separated by one hop are indicated as 101, 102, 103 and 104 and neighboring pages separated by two hops are 105, 106, 107, 108, 109, 110, and 111. Pages with three, four, five, and six hops are also shown. Some pages can be reached via different routes which will clearly change the number of hops. Generally the shortest number of hops will be chosen as the optimum, but there may be occasions where different routes from one page to another will be utilized.
  • Figure 6 shows a URL Hops table 600 for the figure 5 web tree.
  • This table is built by the site analyzer 230, the site analyzer scans all the objects of the closed web site and computes the number of hops separating any object from all the other objects in the closed web site.
  • the columns of the table represent the target web object 0ID 610, while the rows represent the source or active web object. For example if the reference object is 0ID 100 (670), the target objects with OID' s 101, 102, 103 and 104 (640) are separated by one hop, while those with OID' s 136, 137 and 138 (650) are separated by six hops .
  • Figure 7 shows the structure of the object that is transported from the transmission end to the mobile device.
  • the web object Before storing a web object in the scheduler (carrousel) in order to broadcast it to the mobile terminal, the web object is formed into a data packet or container 700.
  • the data packet includes a number of different fields set out below.
  • a OID reference 710 is shown, which is the object identifier associated with the web object by the web site pump 210 and which is attributed to it during the translation phase.
  • the web object size 730 is indicated followed by the web object itself 740.
  • the web object may be a text page, an audio file, an image or any other type of web object.
  • a set of neighbor groups 750, 760, 770 are shown, these map the neighbor objects separated by a given number of hops from a specific or reference web object. Neighbor groupl (750) maps all objects separated by only one hop from the reference web object, while neighbor group2 (760) maps all objects separated by two hops and neighbor groupn (770) maps all object separated by "n" hops.
  • the end of the data packet is represented by the hexadecimal value x'00' .
  • Each neighbor group is structured in a specific manner and neighbor groupl 750 is shown by way of example.
  • the structure includes a neighbor group ID 751 which identifies a network group and the number of hops separating all objects of that group from the reference object. Accordingly a group for hop count equals to one will be identified. Also a group for hop count equals 2 and so on will be identified. This will continue for the maximum number of hops found in the closed web site.
  • the web objects count 752 is also found in the structure and represents the number of web objects (represented by their 0ID) belonging to this group.
  • a list of 0ID' s 753 and 754 which are neighbor at a specific hop count from the neighbor group ID is also determined.
  • the web client 800 on the mobile device includes a web browser 820 communicating with the web server through a communication stack 810.
  • the communication stack may use one of the widely used protocols such as WIFI, 2.5G and 3G protocols (GPRS, EDGE, UMTS) .
  • the web client also includes a video player 880 and a DVB-H receiver 840. Terminals supporting TV broadcast have new functions in accordance with the present invention as will be described below.
  • the Mobile TV receiver includes a number of different functions and layers, for example: a DVB-H Receiver 845; an IP module 850; a user datagram protocol (UDP) module 855 and two session layers 860 and 865 are provided for the data part FLUTE (File Delivery over Unidirectional Transport) and for the video RTP (Real Time Protocol) respectively.
  • a DVB-H Receiver 845 an IP module 850
  • UDP user datagram protocol
  • session layers 860 and 865 are provided for the data part FLUTE (File Delivery over Unidirectional Transport) and for the video RTP (Real Time Protocol) respectively.
  • the video processing elements also includes a H.264 decoder 870 and a video streamer 875.
  • the output from the video streamer is sent to the video player 880 for display to the user.
  • the present invention implements a new component which may be launched as a service which intercepts all requests from the browser and acts as a web proxy 830.
  • This new component examines all files received on FLUTE and delivers to the browser the requested file as soon as it has been received.
  • This component does not need any modification by the client applications, just a configuration of the web browser to define usage of the local web proxy.
  • the web proxy caches the page and then passes it to the web browser.
  • a main function of this client is the predictive caching mechanism which aims to save all neighboring web objects in its own memory 890.
  • the memory may be a SD card, Compact Flash, SIM card or any other appropriate memory.
  • the operation of the cache and memory will be described with reference to figures 9 and 10.
  • Figures 9A and 9B show the objects with 1 or 2 hops around a reference web object 100 for Fig 9A and web object
  • Concentric circles 900 and 910 represent the set of neighbor web objects separated by one hop for circle 900 and the set of objects separated by two hops for circle 910. Obviously there may be many other concentric circles (or layers) to represent the sets of web objects separated by higher numbers of hops (3,4,%), but in the example chosen to illustrate the invention, the number of hops has been arbitrarily fixed at two .
  • All the objects belonging to a neighbor set may be cached in the memory 890.
  • the numbers of layers which are cached will depend on the size of the memory and the sizes of the web objects and also will depend on the prediction factors discussed below.
  • the active web object (object currently displayed in the web browser) is 100.
  • This web object has four neighboring web objects separated by one hop, namely 101, 102,
  • This same active web object has seven neighboring web objects separated by two hops, namely 105, 106, 107, 108, 109, 110 and 111. If a user chooses to click on a link relating to web object 102 a new configuration is formed.
  • This new configuration has web object 102 (the now active web object) in the center and the neighboring web objects at one hop are now web objects 100 and 107 and those at two hops are now web objects 101, 103, 104, 108, 112, 113, 114 and 115.
  • By selecting the active web object to be page 102 there are four new objects 112, 113, 114 and 115 reachable in two hops. These must now be cached.
  • This cache table 1000 is used by the predictive caching function and includes the information detailed below.
  • the OID 1010 is stored as this is the object identification which is associated with the web object belonging to the closed web site by the web pump of the server and is used throughout the invention as the "identifier" of the web object.
  • the table also includes the status 1020, which depicts the status of the web object in the web cache. Examples of status may include:
  • the cache table also includes an indication of the size (in for example kilobytes) 1030 of the object. This is used to calculate the total size taken up by all the objects in cache and to decide which object should be deleted to avoid overflow of the cache.
  • a column hops 1040 represents the number of hops separating each object from the active object (active object is the html/wml page displayed in the web browser or the translation thereof) . In the illustrated example the active web object is 0ID 102.
  • a list of neighbor objects 1050 for OID neighbors with a known number of hops separating the table object from the current or active object are also shown.
  • OID 107 If “active page” (OID 107) has the status "cached", in other words this page has been received, a lookup of the cache table is done. The user can be immediately passed to the requested page or web object.
  • the table also includes information relating to neighbors to this page and other information as previously described. As OID 107 is the "active page” the neighbor set for that page must be cached as soon as possible (if it is not already) . So all neighbors of OID 107 at one hop are set to "cached" or "To be cached". In addition if space permits those at two, three or more hops may be marked similarly.
  • any neighbors of the "previous page" (OID 102) which are not neighbors of OID 107 may now be dropped out of the cache. This is due to the fact that there is now less likelihood of these pages being requested by the user as they are a greater number of hops from the "active page" currently being viewed. Accordingly any neighbors of 0ID 102 (which are not neighbors of 0ID 107) may be marked “may be suppressed” in the table. If space is required these web objects can be removed from the cache and be replaced with pages or web objects that are closer to the "active page” as described above.
  • the system must wait for the reception of this page before it can be viewed.
  • the details of the neighbor set for the "active page” is known from the table and thus the arrival and transmission of all pages are monitored.
  • a web object or page is received it is set to "cached” in the table and stored in the memory if it is the "active page” or any page having an OID in the neighbor set thereof. If the OID belongs to the set of neighbors of the "previous page” and does not belong to the neighbor set of the "active page” the web object is set to
  • the process is started at step 1100.
  • the process determines the root page of the active drive and sets it to be the "active page" in a local variable called AP. Also the local variable called PP which points to the "previous page” is set to the root page. The neighbors of the active page are set as the "active neighbors" (or local variable AN) and the local variable NP which points to a "new page” is set equal to empty or void, because at the moment there is not a "new page”. Then the AP is displayed and the cache table 1000 is updated with the AP record field status 1020 set to "cached” and the hop field 1040 set to zero .
  • step 1110 the process is waiting for an event to occur. If a page is received, then the process continues from step 1115. If the user clicks on a new page, then the process continues from step 1160. In any other case the process remains at step 1110.
  • step 1115 an analysis is carried out to determine if the received page is equal to NP. If this is the case (YES), then the process moves to step 1120; otherwise (NO) the process passes to step 1169.
  • the local variable PP is set to equal AP
  • the local variable AP is set to equal NP
  • the local variable NP is reset to equal void
  • AP is displayed and local variable AN is set to equal the set of neighbors for AP.
  • This step essentially updates the table 1000 and the page viewed by the user in accordance with receipt of a page. This page may be the root page or any other in the closed wed site.
  • the local variable CR (“current record”) is set equal to the first record of the cache table 1000.
  • the hop field 1040 of CR is updated to reflect the new number of hops with respect to the AP and then the process passes to step 1145.
  • the status field 1020 of CR is set to equal "may be suppressed" and the page may be deleted as and when memory is needed for other pages or objects.
  • step 1145 an analysis is carried out to determine if CR is the last record of the cache table 1000. If this is the case (YES) then the process passes to step 1155; otherwise (NO) the process passes to step 1150.
  • step 1150 the record sequentially after CR becomes CR and the process passes to step 1130.
  • step 1155 the cache table 1000 is updated by adding a new record for each member of AN, with a status field 1020 set equal to "to be cached".
  • step 1160 (reached from step 1110 if the user clicks on a new page) the local variable NP is set to equal the new page chosen by the user.
  • step 1165 a determination is made to determine if this NP is already cached. If this is the case (YES) the process passes to step 1120; otherwise (NO) the process passes to step 1110.
  • step 1167 an analysis is carried out to determine if the last event is the reception of a page. If this is the case
  • step 1169 the process passes to step 1169; otherwise (NO) the process passes to step 1110 once again.
  • step 1169 an analysis is carried out to determine if there is enough free space in the cache for storing the received page. If this is the case (YES) the process passes to step 1190; otherwise (NO) the process passes to step 1170.
  • step 1170 an analysis is carried out to determine if the received page is present in AN. If this is the case (YES) the process passes to step 1172; otherwise (NO) the process passes to step 1110.
  • step 1172 an analysis is carried out to determine if the cache contains any page with a status field 1020 equal to "may be suppressed". If this is the case (YES) the process passes to step 1192; otherwise (NO) the process passes to step 1175.
  • step 1175 an analysis is carried out to determine if the cache contains any page whose hop count exceeds the AP hop count by two (2) . If this is the case (YES) the process passes to step 1195; otherwise (NO) the process passes to step 1177.
  • step 1177 an analysis is carried out to determine if the cache contains any page whose hop count exceeds the AP hop count by one (1) . If this is the case (YES) the process passes to step 1185; otherwise (NO) the process passes to step 1180.
  • step 1180 the largest page found in the cache is removed. Then the process passes to step 1169.
  • step 1185 the largest page of the cache meeting the condition of step 1177 is removed. Then the process passes to step 1169.
  • the received page is recorded in the cache; the status field 1020 of the received page is set to equal "cached" and the other fields such a size 1030 and hops 1040 etc. are updated according to the characteristics of the page. Then the process passes back to step 1110.
  • step 1192 the largest page of the cache meeting the condition of step 1172 is removed. Then the process passes back to step 1169.
  • step 1195 the largest page of the cache meeting the condition of step 1175 is removed. Then once again the process passes back to step 1169.
  • the above described method steps are carried out at the receiver end of the system. This relates to the management and provision of the web object to the user.
  • the video is transmitted over the same transport stream or broadcast and received at the same end device.
  • the bursts carrying video and web objects are recognised in accordance with normal DVB-H practices. Details of the methodology for transmission and reception of the broadcast stream is described in details in our co-pending application (FR09 2006 0029), incorporated herein by reference.
  • the mobile device may be of any type, including for example, PDAs, mobile phones or any other type of device which is mobile and can communicate remotely.
  • the standards such as DVB-H, RTP, H.264 etc described herein are shown by way of example only and any other standards present or future may be equally applicable.
  • the broadcaster may have influences on the closed web site to enable them to match the broadcast media, for example, tickets for the next match if a user is watching a certain game, appropriate advertising, etc.

Abstract

A method of controlling the storage of web objects in a storage media, the web objects forming a closed web site and being for delivery to a user, the web objects being stored on the basis of predictive determinations of which web object has a highest probability of being selected from the closed web site by the user, the method comprising: identifying a plurality of web objects which form the closed web site; determining a web tree of the closed web site; identifying one or more parameters associated with the web object or the web tree; determining from the one or more parameters of each web object a probability of the web object being selected so as to generate a set of likely web objects; ordering the likely web objects in accordance with their probability; storing the web object from the set of likely objects in order of their relative probability such that those with the highest probability are stored first; and continuing the storing step for as long as there is available capacity in the storage media.

Description

A METHOD AND APPARATUS FOR CACHING BROADCASTING INFORMATION
Field of the invention
The present invention relates to a method and apparatus for caching broadcasting information, particularly but not exclusively to information presented in the form of web information .
Background Art
In the environment of mobile television (e.g. hand held television) there currently exist two complementary broadcasting techniques. Mobile TV technology offers consumers the chance to enjoy high quality terrestrial digital broadcasts along with voice telephony and Internet access all in a single wireless device. The digital video broadcast for hand-held (DVB-H) broadcasts Internet Protocol (IP) data-casts. This is an end-to-end broadcast system for the delivery of any type of digital content and services using IP based mechanisms that are optimised for devices with limitations on computational and battery resources.
An inherent part of IP data-cast (IPDC) is that it comprises a unidirectional DVD broadcast path that may be combined with a bidirectional mobile cellular interactive part. This means that IPDC is a valuable platform for enabling the convergence of services from broadcast media and telecommunications demand. The file delivery method used in the IPDC is based on the FLUTE protocol. FLUTE (File Delivery Over Unidirectional Transport) delivers contents in files, which may contain any type of data (e.g. Audio, video, binary, still images, texts? ESG carriage return, meta-data etc.). It is possible to access a web server using mobile equipment, for example using WIFI. However, WIFI coverage tends to be very limited and is generally only used in private areas or airports and the like. Although it is possible to connect to a web server using 2.5G or 3G communication means such as GPRS, EDGE or UMTS. In order to do this, it is also necessary to have a subscription which can be expensive and tends to be used by professionals and is not always attractive to public audiences of everyday people. New standards are emerging to broadcast TV on mobile equipment (e.g. DVB-H) . Some of these standards define means for object transmission. Figure 1 depicts a known mobile internet infrastructure 100 and a separated Digital Video Broadcast infrastructure 102. A mobile user may access both TV channels and an internet site using a mobile device 104. TV channel may be broadcast using for example DVB-H in Europe. A video source 106 is broadcasted using a DVB-H head-end 108 which may include a
DVB-H Encapsulator, a DVB-H Streamer or modulator 112 and an amplifier 114. A VHF/UHF signal 116 is emitted by the DVB-H head-end and is received by the mobile device 104. The mobile device may be a GSM phone or a PDA enabled to process DVB-H. In a parallel mode, the mobile device 150 may support conventional Web browser capabilities and can receive signals 118 from an access point 120 belonging to a GPRS/UMTS network 122 for example. The GPRS/UMTS network may access the internet 124 by means of a Telco gateway 126. The internet may access a web server 128 to obtain content for transmission to the user. The web server 128 may also be accessed by means of a non-mobile device, for example computer 130 which may be connected via a wired line such as PSTN, ADSL, etc.
Some of the problems that are encountered with the present systems and methods are set out as follows:
Firstly there is a requirement for expensive subscriptions and wireless equipment able to support a communication stack able to reach the internet and a separate communication stack able to receive broadcast television. A further problem exists in that coverage for internet and TV broadcast are not necessarily equivalent in certain areas, some areas may have coverage of one, some may have coverage of another, and some may have coverage of both or neither.
Our co-pending application (FR09 2006 0029) discloses a method and apparatus to transfer and receive web objects and video media through the same media channel. The content is herein combined by reference. The matter of scheduling of the web objects is discussed therein, and the present invention relates to additional inventive features thereof. One object of the present invention is to overcome at least some of the problems associated with prior art broadcasting methods and systems.
Another object of the present invention is to use new broadcast transmission means to transfer both DVB-H type objects and web type objects through the same media and to schedule the web type object in an intelligent manner.
A further object of the invention is to provide a method for broadcasting website to mobile equipment without requiring return channel means to indicate preferences, but instead to use predictions to assist with the scheduling.
Summary of the invention
The present invention is directed to the method and system as defined in the independent claims. More particularly the present invention discloses a method of controlling the storage of web objects in a storage media, the web objects forming a closed web site and being for delivery to a user, the web objects being stored on the basis of predictive determinations of which web object has a highest probability of being selected from the closed web site by the user, the method comprising: identifying a plurality of web objects which form the closed web site; determining a web tree of the closed web site; identifying one or more parameters associated with the or each web object or the web tree; determining from the one or more parameters of each web object a probability of the web object being selected so as to generate a list of likely web objects; ordering the likely web objects in accordance with their probability; storing the web object from the set of likely objects in order of their relative probability such that those with the highest probability are stored first; and continuing the storing step for as long as there is available capacity in the storage media .
Further embodiments of the invention are provided in the appended dependent claims.
This has an advantage of a scheduler which broadcasts periodically with a frequency related to the probability and predictions of what might be viewed next by a user. By updating parameters of a web object (such as hops or jumps from a root object, size, web tree linkage information etc.) subsequent web objects can be predicted and stored without a need for a return channel and the expense and complexity thereof .
Another advantage exists with respect to the function of allowing reception of web objects. This new function may be added as a service and is viewed by any conventional web browser as a web proxy. When an object is requested by the web browser, the proxy intercepts the request and waits for the reception of said object. As soon as the object reception is complete without error, said object is returned to the web browser and then on to the user. The proxy thus stores web objects which have the highest probability of being selected by a user based on the currently viewed object.
A further advantage is the extension of the functionality of this proxy by caching all pages being at a maximum predefined hops count from the current displayed page. These objects may be called neighboring web objects. As the cache size is limited, web object which are not part of a group of neighboring objects may be suppressed if necessary. Brief description of the drawings
Reference will now be made by way of example, to the accompanying drawings, in which:
Figure 1 is a block diagram of a prior art system for a mobile internet infrastructure and a separated Digital Video Broadcast infrastructure.
Figure 2 is a block diagram of a Web Cast and Digital Video Broadcast infrastructure according to the present invention . Figure 3 is a block diagram showing the relationship between different Web objects according to the present invention .
Figure 4 is a table which shows the URL mapping of the web objects according to the present invention. Figure 5 is a map diagram of a complex web site, showing the interconnections.
Figure 6 is a URL hops table.
Figure 7 shows the structure of a web object as it would be transported over the medium. Figure 8 shows the structure of a web client including object cache in accordance with the present invention.
Figure 9A shows active web objects with 1 and 2 hops around a first reference object.
Figure 9B shows a new active web objects with 1 and 2 hops around a second reference object.
Figure 10 depicts the table managed by the predictive caching function in accordance with the present invention.
Figure 11 depicts the process steps for managing the cache in a predictive way in accordance with the present invention. Detailed description of the preferred embodiment
The Figure 2 shows a web cast and digital video broadcast infrastructure, which includes a web broadcast gateway 200 in accordance with the present invention. The web broadcast gateway 200 is a link between a web server 202 and the digital video broadcast environment. The web broadcast gateway 200 allows a mobile user having only broadcast TV capabilities, to access a web site and to access web objects such as music, Video, images, etc.. The term web objects used herein is intended to include all types of information found on the web. The web broadcast gateway is used to broadcast objects forming a so-called "closed web site". The term closed web site defines a web site in which links found on a web page of the site, refer to an object of this web site. In other words, no reference to an "external site" may be found in a web page of a closed web site. This will be described in greater detail below.
The closed web site is generated in any appropriate manner. One the closed web site is formed it is broadcast from the transmitter as will be described below.
In a preferred embodiment of this invention the web broadcast gateway includes four main elements. The first element of the web broadcast gateway is a website pump 210, which periodically explores the closed web site to identify each object belonging to this site. For each object found, a unique reference, unique object identifier (OID) is determined and a record is added or updated in a table which contains all the objects constituting the closed web site. This table is formed during an interrogation phase in which the content and the relationships between objects forming the closed web site is determined. The table may be marked as complete once the interrogation phase is finished. The second element of the web broadcast gateway is a
Hypertext language translator 220 which modifies the URL found in the web page (HTML, WML, etc.) to a new URL which comprises the OID and optionally meta-data relating to the web page or object. This will be described in greater detail below.
The third element of the web broadcast gateway is a site analyzer 230 which scans all the objects of the closed web site and computes the number of hops separating any object from all other objects included in the closed web site. The fourth element of the web broadcast gateway is the scheduler 240. The scheduler may be in the form of a carrousel which utilises the table built by the web site pump and modified by the site analyser 230. The scheduler includes a weighting associated with each object. The weighting is based on the number of references to this object in the other closed web site pages or objects, the number of "hops" to reach this object from the root page and the size of the object. Using this weighting, the carrousel is populated. The web objects which are identified and stored are delivered to the user using the FLUTE protocol as will be described in greater detail below.
The web objects are downloaded from the web server 128 prior to being processed and stored by the web broadcast gateway 200. The mobile internet structure described in figure 1 is still maintained, i.e. the Telco gateway, GPRS/UMTS and the GSM access point. However they are not used to download web objects. Instead the output from the web broadcast gateway (from the carrousel scheduler 240) is combined with the video source output in the DVB-H encapsulation module as will be described in more detail below. The DVB-H broadcast occurs in bursts with video being broadcast in some bursts and data in others. The carrousel scheduler sends objects in accordance with their weightings. These weightings depend on the position of the object in the closed web site. For example the root page may be sent many times in each circulation of the carrousel, whilst object at the lowest level of the closed web site may be sent only once or twice during each circulation. Similarly objects in between may be sent a several times during each circulation of the carrousel. The mobile device 104 thus only receives transmissions from the DVB-H broadcast head-end for video and web objects in accordance with the present invention. Other connections are still available for other forms of technology for exampled GPRS or UMTS via link 118. However for the purposes of this invention these other connections will not be described further .
The "closed web site" will now be described in more detail with reference to figure 3. Figure 3 shows web objects with their relationships within a closed web site 300. The closed web site is a site where there is no reference to an external web site having references to other sites. In other words, the number of hops starting from the root page to reach the farthest web object in the site is limited to a predefined value. This can vary depending on the storage capacity, the nature of the tree (i.e. how many pages at each hop) and the individual page sizes etc. If there are external references (i.e. outside the closed web site) these can be ignored and/or generate an appropriate message for the user to indicate that the external reference is not available. In the closed web site 300 the root page 310 (Index.htm) has four links. Three links are on other web pages, link 320 on pagel.htm 350, link 330 on page2.htm and link 340 on page3.htm. The fourth link refers to an image 380 (Bkgrnd.gif) . In turn each of the pages on the second level (Page 1, Page 2 and Page 3) have links to pages at a third level which are two hops from the root page 310. Page2.htm is referenced two times in root page (index.htm) 310 and in pagel.htm 350 and this can be reached from different routes.
The third level of pages (pages 11, 12, 21, 22, 31 and 32) is the final level in this example. In other situations the tree could include more or less than three levels of pages. A file (or object) VIDEOl. mpg can be accessed in a number of different locations (e.g. page 1, page 21, page 22 and page 31) . Similarly AUDIOl. mp3 can be accessed on pages 2, 11 and 21. This gives flexibility to the user in accessing web objects or files and will assist in minimising delays in the user viewing specific pages. With knowledge of the web object tree and the various jumps, hops or steps between respective pages a table can be created which will be described shortly with reference to figure 4. This enables the system of the present invention to operate hypertext language translations and to optimise storage.
The Figure 4 shows a URL Mapping table 400 according to the present invention in which each entry represents an object of the closed web site shown in figure 3. The table includes a number of columns in which the headings are identified and described below.
A page name 410 is shown in column 1 and is the name of the object as defined in the web server. This name may be prefixed by a hierarchy of directories which are under the Web home directory, or in any other appropriate manner.
An object type 430 is shown in column 2 and indicates the Web object type. Examples include "Text" for html or wml pages; "image" for Jpeg, gif, tiff or any other image or photo files; "audio" for mp3, wma, avi or any other audio or music file formats; "video" for mpeg, avi, QuickTime movie or any other video file formats.
An object identifier (OID) 440 is shown in the next column. The OID is a unique reference number which identifies the web object in the closed web site. This OID will substitute the page name in the links found in all the pages forming the closed web site as will be described in greater detail below.
A field indicating usage 450 is also identified and populated. Although all objects are broadcast, some objects may not be usable as they are received with control access protection. To make these objects usable, the user (consumer) may request a license. Usage "on demand" is associated with these protected objects, while usage "broadcast" is used for all other objects.
A root hop count 460 indicates the number of pages separating the object from the root page. This information will be used to compute the object weight for the carrousel scheduling . A reference count 470 is the number of references pointing to an object in all the pages in the closed web site. This information is also used in the computation of the object weight for the carrousel scheduling.
A size indicator 480 of object gives the size in Kbytes and is also used in the computation of the object weight for the carrousel scheduling as will be described below.
The above-described table is used in the translator and also in the scheduler (or carrousel) as will be described below. Figure 5 shows a complete web tree for a small closed web site 500 used as an example in this invention. The web object with the OID reference 100 is the root page. Neighboring pages or objects separated by one hop are indicated as 101, 102, 103 and 104 and neighboring pages separated by two hops are 105, 106, 107, 108, 109, 110, and 111. Pages with three, four, five, and six hops are also shown. Some pages can be reached via different routes which will clearly change the number of hops. Generally the shortest number of hops will be chosen as the optimum, but there may be occasions where different routes from one page to another will be utilized.
Figure 6 shows a URL Hops table 600 for the figure 5 web tree. This table is built by the site analyzer 230, the site analyzer scans all the objects of the closed web site and computes the number of hops separating any object from all the other objects in the closed web site. The columns of the table represent the target web object 0ID 610, while the rows represent the source or active web object. For example if the reference object is 0ID 100 (670), the target objects with OID' s 101, 102, 103 and 104 (640) are separated by one hop, while those with OID' s 136, 137 and 138 (650) are separated by six hops .
Figure 7 shows the structure of the object that is transported from the transmission end to the mobile device. Before storing a web object in the scheduler (carrousel) in order to broadcast it to the mobile terminal, the web object is formed into a data packet or container 700. The data packet includes a number of different fields set out below.
A OID reference 710 is shown, which is the object identifier associated with the web object by the web site pump 210 and which is attributed to it during the translation phase. The web object size 730 is indicated followed by the web object itself 740. The web object may be a text page, an audio file, an image or any other type of web object. A set of neighbor groups 750, 760, 770 are shown, these map the neighbor objects separated by a given number of hops from a specific or reference web object. Neighbor groupl (750) maps all objects separated by only one hop from the reference web object, while neighbor group2 (760) maps all objects separated by two hops and neighbor groupn (770) maps all object separated by "n" hops. The end of the data packet is represented by the hexadecimal value x'00' .
Each neighbor group is structured in a specific manner and neighbor groupl 750 is shown by way of example. The structure includes a neighbor group ID 751 which identifies a network group and the number of hops separating all objects of that group from the reference object. Accordingly a group for hop count equals to one will be identified. Also a group for hop count equals 2 and so on will be identified. This will continue for the maximum number of hops found in the closed web site. The web objects count 752 is also found in the structure and represents the number of web objects (represented by their 0ID) belonging to this group. A list of 0ID' s 753 and 754 which are neighbor at a specific hop count from the neighbor group ID is also determined. In this way there are all relevant details of the closed web site and the tree structure in each data packet. This will enable the predictive caching of web object in accordance with user requests . The data packets are transmitted to the web client in the mobile device as will now be described with reference to figure 8. The web client 800 on the mobile device includes a web browser 820 communicating with the web server through a communication stack 810. The communication stack may use one of the widely used protocols such as WIFI, 2.5G and 3G protocols (GPRS, EDGE, UMTS) . The web client also includes a video player 880 and a DVB-H receiver 840. Terminals supporting TV broadcast have new functions in accordance with the present invention as will be described below. The Mobile TV receiver includes a number of different functions and layers, for example: a DVB-H Receiver 845; an IP module 850; a user datagram protocol (UDP) module 855 and two session layers 860 and 865 are provided for the data part FLUTE (File Delivery over Unidirectional Transport) and for the video RTP (Real Time Protocol) respectively.
The video processing elements also includes a H.264 decoder 870 and a video streamer 875. The output from the video streamer is sent to the video player 880 for display to the user. The present invention implements a new component which may be launched as a service which intercepts all requests from the browser and acts as a web proxy 830. This new component examines all files received on FLUTE and delivers to the browser the requested file as soon as it has been received. This component does not need any modification by the client applications, just a configuration of the web browser to define usage of the local web proxy. The web proxy caches the page and then passes it to the web browser. A main function of this client is the predictive caching mechanism which aims to save all neighboring web objects in its own memory 890. The memory may be a SD card, Compact Flash, SIM card or any other appropriate memory. The operation of the cache and memory will be described with reference to figures 9 and 10. Figures 9A and 9B show the objects with 1 or 2 hops around a reference web object 100 for Fig 9A and web object
102 (which become the reference web object in this example) for Fig 9B . In these figures the active web object (HTML page or translation thereof) is represented in the center. Concentric circles 900 and 910 represent the set of neighbor web objects separated by one hop for circle 900 and the set of objects separated by two hops for circle 910. Obviously there may be many other concentric circles (or layers) to represent the sets of web objects separated by higher numbers of hops (3,4,...), but in the example chosen to illustrate the invention, the number of hops has been arbitrarily fixed at two .
All the objects belonging to a neighbor set may be cached in the memory 890. The numbers of layers which are cached will depend on the size of the memory and the sizes of the web objects and also will depend on the prediction factors discussed below.
In the Fig 9A, the active web object (object currently displayed in the web browser) is 100. This web object has four neighboring web objects separated by one hop, namely 101, 102,
103 and 104. Moreover this same active web object has seven neighboring web objects separated by two hops, namely 105, 106, 107, 108, 109, 110 and 111. If a user chooses to click on a link relating to web object 102 a new configuration is formed. This new configuration has web object 102 ( the now active web object) in the center and the neighboring web objects at one hop are now web objects 100 and 107 and those at two hops are now web objects 101, 103, 104, 108, 112, 113, 114 and 115. By selecting the active web object to be page 102 there are four new objects 112, 113, 114 and 115 reachable in two hops. These must now be cached. At the same time five old objects 105, 106, 109, 110 and 111 are no more in the neighbor scope and thus may be removed from the cache if necessary to free up memory. The information relating to the objects and how they are cached may be stored in a table such as that illustrated in figure 10. This cache table 1000 is used by the predictive caching function and includes the information detailed below.
The OID 1010 is stored as this is the object identification which is associated with the web object belonging to the closed web site by the web pump of the server and is used throughout the invention as the "identifier" of the web object. The table also includes the status 1020, which depicts the status of the web object in the web cache. Examples of status may include:
- "Cached": this indicates that the web object has been saved in the cache, as is the case with web object 104 (1060) .
- "May be suppressed": this indicates that the web object is present in the cache but no longer belongs to the group of neighbor web objects, as is the case with web object 106 (1070) .
"To be cached": this indicates that the web object belongs to the scope of neighbor web objects but has not yet been received, as is the case with web object 111 (1080) . The cache table also includes an indication of the size (in for example kilobytes) 1030 of the object. This is used to calculate the total size taken up by all the objects in cache and to decide which object should be deleted to avoid overflow of the cache. A column hops 1040 represents the number of hops separating each object from the active object (active object is the html/wml page displayed in the web browser or the translation thereof) . In the illustrated example the active web object is 0ID 102. A list of neighbor objects 1050 for OID neighbors with a known number of hops separating the table object from the current or active object are also shown. The list of all objects separated from the current OID by the number maximum of hops configured in the client. In the illustrated example the maximum number of hops is 2. If it is assumed that a user is viewing a current active page (say OID 102) and clicks on a link to display a new page (say OID 107) a local variable "active page" is set in the table for OID 107. The current active page (OID 102) is then set to the local variable "previous page" in the line of the table associated with OID 102.
If "active page" (OID 107) has the status "cached", in other words this page has been received, a lookup of the cache table is done. The user can be immediately passed to the requested page or web object. The table also includes information relating to neighbors to this page and other information as previously described. As OID 107 is the "active page" the neighbor set for that page must be cached as soon as possible (if it is not already) . So all neighbors of OID 107 at one hop are set to "cached" or "To be cached". In addition if space permits those at two, three or more hops may be marked similarly. If there is insufficient space to cache all web objects or pages at two, three or more hops a selection may be made based first on lowest hop number then on size or another parameter to determine the pages to cache and those to be cached when space permits. In addition, any neighbors of the "previous page" (OID 102) which are not neighbors of OID 107 may now be dropped out of the cache. This is due to the fact that there is now less likelihood of these pages being requested by the user as they are a greater number of hops from the "active page" currently being viewed. Accordingly any neighbors of 0ID 102 (which are not neighbors of 0ID 107) may be marked "may be suppressed" in the table. If space is required these web objects can be removed from the cache and be replaced with pages or web objects that are closer to the "active page" as described above.
If the "active page" 0ID 107 has a status "to be cached", the system must wait for the reception of this page before it can be viewed. The details of the neighbor set for the "active page" is known from the table and thus the arrival and transmission of all pages are monitored. When a web object or page is received it is set to "cached" in the table and stored in the memory if it is the "active page" or any page having an OID in the neighbor set thereof. If the OID belongs to the set of neighbors of the "previous page" and does not belong to the neighbor set of the "active page" the web object is set to
"may be suppressed". Any pages of the "active page" which are not already on route and captured as above will be marked "to be cached" and a request for delivery will be made.
These process steps for this are described in greater detail with respect to figure 11.
The process is started at step 1100.
At step 1105, the process determines the root page of the active drive and sets it to be the "active page" in a local variable called AP. Also the local variable called PP which points to the "previous page" is set to the root page. The neighbors of the active page are set as the "active neighbors" (or local variable AN) and the local variable NP which points to a "new page" is set equal to empty or void, because at the moment there is not a "new page". Then the AP is displayed and the cache table 1000 is updated with the AP record field status 1020 set to "cached" and the hop field 1040 set to zero .
At step 1110, the process is waiting for an event to occur. If a page is received, then the process continues from step 1115. If the user clicks on a new page, then the process continues from step 1160. In any other case the process remains at step 1110.
At step 1115, an analysis is carried out to determine if the received page is equal to NP. If this is the case (YES), then the process moves to step 1120; otherwise (NO) the process passes to step 1169.
At step 1120, the local variable PP is set to equal AP, the local variable AP is set to equal NP, the local variable NP is reset to equal void. AP is displayed and local variable AN is set to equal the set of neighbors for AP. This step essentially updates the table 1000 and the page viewed by the user in accordance with receipt of a page. This page may be the root page or any other in the closed wed site.
At step 1125, the local variable CR ("current record") is set equal to the first record of the cache table 1000.
At step 1130, a determination is made as to whether CR. OID belongs to AN. If it is the case (YES), then the process passes to step 1135; otherwise (NO) the process passes to step 1140. At step 1135, the hop field 1040 of CR is updated to reflect the new number of hops with respect to the AP and then the process passes to step 1145.
At step 1140, the status field 1020 of CR is set to equal "may be suppressed" and the page may be deleted as and when memory is needed for other pages or objects.
At step 1145, an analysis is carried out to determine if CR is the last record of the cache table 1000. If this is the case (YES) then the process passes to step 1155; otherwise (NO) the process passes to step 1150. At step 1150, the record sequentially after CR becomes CR and the process passes to step 1130. At step 1155, the cache table 1000 is updated by adding a new record for each member of AN, with a status field 1020 set equal to "to be cached".
At step 1160 (reached from step 1110 if the user clicks on a new page) the local variable NP is set to equal the new page chosen by the user.
At step 1165, a determination is made to determine if this NP is already cached. If this is the case (YES) the process passes to step 1120; otherwise (NO) the process passes to step 1110.
At step 1167, an analysis is carried out to determine if the last event is the reception of a page. If this is the case
(YES) the process passes to step 1169; otherwise (NO) the process passes to step 1110 once again. At step 1169, an analysis is carried out to determine if there is enough free space in the cache for storing the received page. If this is the case (YES) the process passes to step 1190; otherwise (NO) the process passes to step 1170.
At step 1170, an analysis is carried out to determine if the received page is present in AN. If this is the case (YES) the process passes to step 1172; otherwise (NO) the process passes to step 1110.
At step 1172, an analysis is carried out to determine if the cache contains any page with a status field 1020 equal to "may be suppressed". If this is the case (YES) the process passes to step 1192; otherwise (NO) the process passes to step 1175.
At step 1175, an analysis is carried out to determine if the cache contains any page whose hop count exceeds the AP hop count by two (2) . If this is the case (YES) the process passes to step 1195; otherwise (NO) the process passes to step 1177.
At step 1177 , an analysis is carried out to determine if the cache contains any page whose hop count exceeds the AP hop count by one (1) . If this is the case (YES) the process passes to step 1185; otherwise (NO) the process passes to step 1180.
At step 1180, the largest page found in the cache is removed. Then the process passes to step 1169. At step 1185, the largest page of the cache meeting the condition of step 1177 is removed. Then the process passes to step 1169.
At step 1190, the received page is recorded in the cache; the status field 1020 of the received page is set to equal "cached" and the other fields such a size 1030 and hops 1040 etc. are updated according to the characteristics of the page. Then the process passes back to step 1110.
At step 1192, the largest page of the cache meeting the condition of step 1172 is removed. Then the process passes back to step 1169.
At step 1195, the largest page of the cache meeting the condition of step 1175 is removed. Then once again the process passes back to step 1169.
The above described method steps are carried out at the receiver end of the system. This relates to the management and provision of the web object to the user. In parallel the video is transmitted over the same transport stream or broadcast and received at the same end device. The bursts carrying video and web objects are recognised in accordance with normal DVB-H practices. Details of the methodology for transmission and reception of the broadcast stream is described in details in our co-pending application (FR09 2006 0029), incorporated herein by reference.
The mobile device may be of any type, including for example, PDAs, mobile phones or any other type of device which is mobile and can communicate remotely. The standards such as DVB-H, RTP, H.264 etc described herein are shown by way of example only and any other standards present or future may be equally applicable.
The broadcaster may have influences on the closed web site to enable them to match the broadcast media, for example, tickets for the next match if a user is watching a certain game, appropriate advertising, etc.
While the invention has been particularly shown and described with reference to a preferred embodiment, it will be understood that various changes in form and detail may be made therein without departing from the spirit, and scope of the invention .

Claims

1. A method of controlling the storage of web objects in a storage media, the web objects forming a closed web site and being for delivery to a user, the web objects being stored on the basis of predictive determinations of which web object has a highest probability of being selected from the closed web site by the user, the method comprising:
- identifying a plurality of web objects which form the closed web site; - determining a web tree of the closed web site;
- identifying one or more parameters associated with the or each web object or the web tree;
- determining from the one or more parameters of each web object a probability of the web object being selected so as to generate a list of likely web objects;
- ordering the likely web objects in accordance with their probability;
- storing the web object from the set of likely objects in order of their relative probability such that those with the highest probability are stored first; and
- continuing the storing step for as long as there is available capacity in the storage media.
2. The method of claim 1, wherein the step of determining the probability of each web object being selected comprises determining a numbers of steps from a root web object to each web object in the closed web site, such that those with a lower number of steps have a higher probability of being selected than those with a higher number of steps.
3. The method of claim 1 or claim 2, wherein the step of determining the probability of each web object being selected comprises determining the size of each web object such that those of a smaller size have a higher probability than those with a bigger size.
4. The method of any one of claims 1 to 3, wherein the step of determining the probability of each web object being selected comprises determining the number of neighboring web objects for the or each web object .
5. The method of any preceding claim, wherein the step of determining the probability of each web object being selected comprises determining the number of links to other web objects in the or each web object.
6. The method of any preceding claim, further comprising updating the determination of parameters if the user selects a web object, such that a new set of parameters are determined and used to determine the probability of the subsequent selection of the user.
7. The method of claim 6, further comprising removing web objects from the storage media based on the probabilities for the subsequent selection, such that the web object with the lowest priority of being subsequently selected is removed first.
8. The method of claim 7, further comprising removing the web objects from the storage media until there is sufficient space in the storage medium for new high priority web objects.
9. The method of claim 6 or claim 7, further comprising repeating the updating step each time the user makes a selection .
10. The method of any preceding claim, further comprising forming a table of the parameters for each web object so that the probability can be determined from the table of parameters .
11. A method of viewing a web object at a remote mobile device (104) having a memory (890) which web object is transmitted over a broadcast standard channel in accordance with the method according to any preceding claim, the method of viewing comprising the steps of: - receiving a plurality of web objects forming some or all of a closed web site, wherein the web object includes one or more parameters which determine a priority of a user selecting that object;
- storing the web objects in the memory in accordance with the priority, such that those with the highest probability are stored first;
- selecting a web object to view, thereby causing an update to the one or more parameters and a new probability for the or each web object being subsequently selected; - updating the web objects stored in memory based on the new priority such that those with the highest new probability are stored first;
- repeating the selecting and updating steps as required.
12. A system comprising means adapted for carrying out the steps of the method according to any one of claims 1 to 10.
13. A device comprising means adapted for carrying out the steps of the method according to any one claims 1 to 11.
14. A computer program comprising instructions for carrying out the steps of the method according to any one of claims 1 to 10, when said computer program is executed on a computer system.
15. A computer program comprising instructions for carrying out the steps of the method according to claim 11, when said computer program is executed on a computer system.
PCT/EP2007/053659 2006-06-30 2007-04-16 A method and apparatus for caching broadcasting information WO2008000530A1 (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
JP2009517052A JP2009541877A (en) 2006-06-30 2007-04-16 Method, system, apparatus and computer program for controlling web objects (method and apparatus for caching broadcast information)
EP07728124A EP2036307A1 (en) 2006-06-30 2007-04-16 A method and apparatus for caching broadcasting information

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
EP06116433 2006-06-30
EP06116433.1 2006-06-30

Publications (1)

Publication Number Publication Date
WO2008000530A1 true WO2008000530A1 (en) 2008-01-03

Family

ID=38179594

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/EP2007/053659 WO2008000530A1 (en) 2006-06-30 2007-04-16 A method and apparatus for caching broadcasting information

Country Status (5)

Country Link
US (1) US8407260B2 (en)
EP (1) EP2036307A1 (en)
JP (1) JP2009541877A (en)
CN (1) CN101455057A (en)
WO (1) WO2008000530A1 (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US10195890B2 (en) 2014-09-16 2019-02-05 Crane Security Technologies, Inc. Secure lens layer

Families Citing this family (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2009541877A (en) 2006-06-30 2009-11-26 インターナショナル・ビジネス・マシーンズ・コーポレーション Method, system, apparatus and computer program for controlling web objects (method and apparatus for caching broadcast information)
US7873703B2 (en) * 2006-06-30 2011-01-18 International Business Machines Corporation Method and apparatus for broadcasting information
US20090112975A1 (en) * 2007-10-31 2009-04-30 Microsoft Corporation Pre-fetching in distributed computing environments
US8185910B2 (en) 2008-08-27 2012-05-22 Eric Sven-Johan Swildens Method and system for testing interactions between web clients and networked servers
JP5321221B2 (en) * 2009-04-22 2013-10-23 富士通株式会社 Management device, management method, and program
US9253548B2 (en) * 2010-05-27 2016-02-02 Adobe Systems Incorporated Optimizing caches for media streaming
US8265778B2 (en) 2010-06-17 2012-09-11 Microsoft Corporation Event prediction using hierarchical event features
US20130198351A1 (en) * 2012-01-27 2013-08-01 Alcatel-Lucent Usa Inc Flexible Caching in a Content Centric Network
KR101978177B1 (en) * 2012-12-07 2019-08-28 삼성전자주식회사 Method of caching contents by node and method of transmitting contents by contents provider in a content centric network
CN104954396B (en) * 2014-03-25 2018-09-07 华为技术有限公司 The method of resource downloading and the device of resource downloading
CN104580435B (en) * 2014-12-27 2019-03-08 北京奇虎科技有限公司 A kind of caching method and device of network connection

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6098064A (en) * 1998-05-22 2000-08-01 Xerox Corporation Prefetching and caching documents according to probability ranked need S list
WO2002073460A2 (en) * 2001-03-08 2002-09-19 International Business Machines Corporation Predictive caching and highlighting of web pages
US20050216554A1 (en) * 2004-02-19 2005-09-29 Yehuda Meiman Use of cache memory for decreasing the number of requests forwarded to server computers

Family Cites Families (53)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP3488289B2 (en) * 1994-09-19 2004-01-19 Kddi株式会社 Hypermedia document communication device
US6460036B1 (en) * 1994-11-29 2002-10-01 Pinpoint Incorporated System and method for providing customized electronic newspapers and target advertisements
US20020049832A1 (en) * 1996-03-08 2002-04-25 Craig Ullman Enhanced video programming system and method for incorporating and displaying retrieved integrated internet information segments
US6018764A (en) 1996-12-10 2000-01-25 General Instrument Corporation Mapping uniform resource locators to broadcast addresses in a television signal
US5933849A (en) * 1997-04-10 1999-08-03 At&T Corp Scalable distributed caching system and method
US6266742B1 (en) * 1997-10-27 2001-07-24 International Business Machines Corporation Algorithm for cache replacement
US6128701A (en) * 1997-10-28 2000-10-03 Cache Flow, Inc. Adaptive and predictive cache refresh policy
US6377981B1 (en) 1997-11-20 2002-04-23 Cyberstar, L.P. Modular digital data communication cyberstation and cyberserver
US6959318B1 (en) * 1998-03-06 2005-10-25 Intel Corporation Method of proxy-assisted predictive pre-fetching with transcoding
US6459427B1 (en) * 1998-04-01 2002-10-01 Liberate Technologies Apparatus and method for web-casting over digital broadcast TV network
US6385699B1 (en) * 1998-04-10 2002-05-07 International Business Machines Corporation Managing an object store based on object replacement penalties and reference probabilities
US6128627A (en) * 1998-04-15 2000-10-03 Inktomi Corporation Consistent data storage in an object cache
US6338117B1 (en) * 1998-08-28 2002-01-08 International Business Machines Corporation System and method for coordinated hierarchical caching and cache replacement
EP0984367B1 (en) * 1998-09-03 2003-05-14 Ricoh Company, Ltd. Recording media with video, respectively audio index information, information management and retrieval methods for video, respectively audio information and a video retrieval system
JP2000099438A (en) * 1998-09-18 2000-04-07 Toshiba Corp Information processor and information look-ahead method
US6317778B1 (en) * 1998-11-23 2001-11-13 International Business Machines Corporation System and method for replacement and duplication of objects in a cache
US6470436B1 (en) * 1998-12-01 2002-10-22 Fast-Chip, Inc. Eliminating memory fragmentation and garbage collection from the process of managing dynamically allocated memory
US7406098B2 (en) 1999-01-13 2008-07-29 Qualcomm Incorporated Resource allocation in a communication system supporting application flows having quality of service requirements
US6542967B1 (en) * 1999-04-12 2003-04-01 Novell, Inc. Cache object store
CN1700767A (en) 1999-04-15 2005-11-23 思凯思跃姆网络有限公司 Data broadcast system
JP2000311108A (en) * 1999-04-27 2000-11-07 Nec Corp System and method for loading home page
US20010047394A1 (en) * 1999-09-10 2001-11-29 Kloba David D. System, method, and computer program product for executing scripts on mobile devices
US6775231B1 (en) 1999-09-24 2004-08-10 Cisco Technology Dynamic weighted resource sharing
US6330395B1 (en) 1999-12-29 2001-12-11 Chia-Hsiung Wu Heating apparatus with safety sealing
JP4514872B2 (en) * 2000-01-26 2010-07-28 シャープ株式会社 Information acquisition apparatus, information acquisition method, and computer-readable recording medium on which information acquisition program is recorded
US6820133B1 (en) * 2000-02-07 2004-11-16 Netli, Inc. System and method for high-performance delivery of web content using high-performance communications protocol between the first and second specialized intermediate nodes to optimize a measure of communications performance between the source and the destination
US7565450B2 (en) * 2000-03-16 2009-07-21 Adara Networks Inc. System and method for using a mapping between client addresses and addresses of caches to support content delivery
US7552233B2 (en) * 2000-03-16 2009-06-23 Adara Networks, Inc. System and method for information object routing in computer networks
US7398312B1 (en) * 2000-03-29 2008-07-08 Lucent Technologies Inc. Method and system for caching streaming multimedia on the internet
FR2808909B1 (en) * 2000-05-11 2005-06-03 Jean Marie Billiotte METHOD FOR CENTRALIZED STOCHASTIC SIMULATION AND TELETRANSMISSION OF PROBABLE SCENARIOS FOR THE PROBABILISTIC OPTIMIZATION OF PARAMETERS OF REMOTE INDUSTRIAL SYSTEMS
US20040054572A1 (en) * 2000-07-27 2004-03-18 Alison Oldale Collaborative filtering
US20020069296A1 (en) 2000-12-06 2002-06-06 Bernie Aua Internet content reformatting apparatus and method
US7536413B1 (en) * 2001-05-07 2009-05-19 Ixreveal, Inc. Concept-based categorization of unstructured objects
US6754799B2 (en) * 2001-05-16 2004-06-22 Microsoft Corporation System and method for indexing and retrieving cached objects
US20030030666A1 (en) * 2001-08-07 2003-02-13 Amir Najmi Intelligent adaptive navigation optimization
CA2440236C (en) 2001-12-28 2008-02-12 Nokia Corporation Packet scheduling method and apparatus
EP1324553A3 (en) 2001-12-31 2006-03-22 Alcatel Canada Inc. Method and apparatus for scheduling and servicing events using a calendar structure
JP4535661B2 (en) 2002-03-18 2010-09-01 日本電気株式会社 Transmission node, relay node and communication system in wireless multi-hop network
US7328192B1 (en) * 2002-05-10 2008-02-05 Oracle International Corporation Asynchronous data mining system for database management system
CN1221898C (en) * 2002-08-13 2005-10-05 国际商业机器公司 System and method for updating network proxy cache server object
US7065780B2 (en) 2002-09-20 2006-06-20 Opentv, Inc. Method and system for emulating and HTTP server through a broadcast carousel
US6970975B2 (en) * 2002-11-15 2005-11-29 Exanet Co. Method for efficient caching and enumerating objects in distributed storage systems
US20040157193A1 (en) * 2003-02-10 2004-08-12 Mejias Ulises Ali Computer-aided design and production of an online learning course
JP2004260573A (en) 2003-02-26 2004-09-16 Recruit Co Ltd System and method for providing contents
JP2004280405A (en) * 2003-03-14 2004-10-07 Sony Corp System and method for providing information, and computer program
US20050005027A1 (en) * 2003-04-18 2005-01-06 International Business Machines Corporation Method and system for obtaining data through an IP transmission network by using an optimized domain name server
US7076611B2 (en) * 2003-08-01 2006-07-11 Microsoft Corporation System and method for managing objects stored in a cache
US20050278540A1 (en) * 2004-05-26 2005-12-15 Opinity, Inc. System, method, and computer program product for validating an identity claimed by a subject
US7571167B1 (en) * 2004-06-15 2009-08-04 David Anthony Campana Peer-to-peer network content object information caching
US7769974B2 (en) * 2004-09-10 2010-08-03 Microsoft Corporation Increasing data locality of recently accessed resources
WO2006124287A2 (en) * 2005-05-02 2006-11-23 Brown University Importance ranking for a hierarchical collection of objects
US7693804B2 (en) * 2005-11-28 2010-04-06 Fatlens Inc. Method, system and computer program product for identifying primary product objects
JP2009541877A (en) 2006-06-30 2009-11-26 インターナショナル・ビジネス・マシーンズ・コーポレーション Method, system, apparatus and computer program for controlling web objects (method and apparatus for caching broadcast information)

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6098064A (en) * 1998-05-22 2000-08-01 Xerox Corporation Prefetching and caching documents according to probability ranked need S list
WO2002073460A2 (en) * 2001-03-08 2002-09-19 International Business Machines Corporation Predictive caching and highlighting of web pages
US20050216554A1 (en) * 2004-02-19 2005-09-29 Yehuda Meiman Use of cache memory for decreasing the number of requests forwarded to server computers

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US10195890B2 (en) 2014-09-16 2019-02-05 Crane Security Technologies, Inc. Secure lens layer

Also Published As

Publication number Publication date
JP2009541877A (en) 2009-11-26
EP2036307A1 (en) 2009-03-18
US8407260B2 (en) 2013-03-26
US20080005163A1 (en) 2008-01-03
CN101455057A (en) 2009-06-10

Similar Documents

Publication Publication Date Title
US8407260B2 (en) Method and apparatus for caching broadcasting information
US7721337B2 (en) System and method for providing a push of background data
EP1550310B2 (en) Method and system for emulating an http server through a broadcast carousel
CN108063769B (en) Method and device for realizing content service and content distribution network node
US20040002896A1 (en) Collection of behavior data on a broadcast data network
EP2493191B1 (en) Method, device and system for realizing hierarchically requesting content in http streaming system
US20130208888A1 (en) Managing content distribution in a wireless communications environment
CN105095490A (en) Target image searching method, terminal and system
US9143552B2 (en) Dynamic profile management based on transport detection
WO2013098468A1 (en) Method and apparatus for flexible caching of delivered media
CN104137564A (en) Controlled streaming of segmented content
KR20180123500A (en) Signaling of application content packaging and delivery
WO2010083214A2 (en) Content set based deltacasting
CN105117455A (en) Along-the-road target image search method, terminal and system
US8566245B2 (en) Generating and using network data to provide a content customization service
US8817983B2 (en) Streaming video to cellular phones
US20120278448A1 (en) Method and System for Accessing Microblog, and Method and System for Sending Pictures on Microblog Website
CN102474517A (en) A method of switching media content for a mobile apparatus
WO2011017976A1 (en) Method and system for media service
CN102404616B (en) Method and system for pushing data cloud based on digital television network
US8661473B2 (en) Apparatus and method for providing contents in internet broadcasting system
US7873703B2 (en) Method and apparatus for broadcasting information
CN102640515A (en) Method and set top box for acquiring program content
CN104065678B (en) A kind of dual mode data based on built-in browser is received and access method
US20090193462A1 (en) Apparatus and method for transmitting/receiving electronic service guide in digital video broadcasting system

Legal Events

Date Code Title Description
WWE Wipo information: entry into national phase

Ref document number: 200780018990.X

Country of ref document: CN

121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 07728124

Country of ref document: EP

Kind code of ref document: A1

WWE Wipo information: entry into national phase

Ref document number: 2009517052

Country of ref document: JP

Ref document number: 2007728124

Country of ref document: EP

NENP Non-entry into the national phase

Ref country code: DE

NENP Non-entry into the national phase

Ref country code: RU