US20140150021A1 - System and method for seamless content insertion on network content using audio-video fingerprinting and watermarking - Google Patents

System and method for seamless content insertion on network content using audio-video fingerprinting and watermarking Download PDF

Info

Publication number
US20140150021A1
US20140150021A1 US14/131,504 US201214131504A US2014150021A1 US 20140150021 A1 US20140150021 A1 US 20140150021A1 US 201214131504 A US201214131504 A US 201214131504A US 2014150021 A1 US2014150021 A1 US 2014150021A1
Authority
US
United States
Prior art keywords
content
network
amagi
assets
audio
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Abandoned
Application number
US14/131,504
Inventor
Baskar Subramanian
Shashidhar Banger
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Individual
Original Assignee
Individual
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Individual filed Critical Individual
Priority to US14/131,504 priority Critical patent/US20140150021A1/en
Publication of US20140150021A1 publication Critical patent/US20140150021A1/en
Abandoned legal-status Critical Current

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/23Processing of content or additional data; Elementary server operations; Server middleware
    • H04N21/235Processing of additional data, e.g. scrambling of additional data or processing content descriptors
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T1/00General purpose image data processing
    • G06T1/0021Image watermarking
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/435Processing of additional data, e.g. decrypting of additional data, reconstructing software from modules extracted from the transport stream
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/45Management operations performed by the client for facilitating the reception of or the interaction with the content or administrating data related to the end-user or to the client device itself, e.g. learning user preferences for recommending movies, resolving scheduling conflicts
    • H04N21/462Content or additional data management, e.g. creating a master electronic program guide from data received from the Internet and a Head-end, controlling the complexity of a video stream by scaling the resolution or bit-rate based on the client capabilities
    • H04N21/4622Retrieving content or additional data from different sources, e.g. from a broadcast channel and the Internet
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/80Generation or processing of content or additional data by content creator independently of the distribution process; Content per se
    • H04N21/81Monomedia components thereof
    • H04N21/812Monomedia components thereof involving advertisement data
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/80Generation or processing of content or additional data by content creator independently of the distribution process; Content per se
    • H04N21/83Generation or processing of protective or descriptive data associated with content; Content structuring
    • H04N21/835Generation of protective data, e.g. certificates
    • H04N21/8358Generation of protective data, e.g. certificates involving watermark

Definitions

  • the present invention relates to a System and method for seamless content insertion on transmitted network content using audio-video fingerprinting and watermarking.
  • Local content insertion on network content is accomplished to address targeted ad insertion or local content in cable Multiple System Operator (MSO) headends, set-top boxes and Internet browsers and in Internet content delivery networks.
  • MSO Multiple System Operator
  • One application of local content insertion is targeted advertising wherein advertisements are placed to appeal to consumers or potential consumers, who are profiled based on demographics, besides a number of other variables.
  • Advances in digital transmission technology enable new and improved methods for targeted advertising where advertisers can further verify the effectiveness of the ads and refine strategies utilizing quick feedback loops.
  • Traditional broadcasting models where the same advertisement or content is seen by all users are slowly being replaced by local content insertion, to increase the spectrum of local advertisements.
  • VOD video on demand
  • SDV switched digital video
  • the network content that is streamed currently provides an out-of-band identification marker that determines the point of insertion that guarantees seamless local content insertion.
  • This marker could be a Dual Tone Multi Frequency (DTMF) cue tone, an SCTE-35 message or an Adobe Flash ad insertion trigger.
  • DTMF Dual Tone Multi Frequency
  • SCTE-35 SCTE-35
  • Adobe Flash ad insertion trigger This marker is sent over the network from the source of transmission in an out-of-band manner synchronized with the point of insertion on the audio-video content.
  • WO 2006097825 discloses a System and Method for household-targeted advertising wherein as Set Top Box (essentially housed at a consumer's premises) has targeted ads delivered to it, wherein the system is specifically programmed for an IP stack.
  • customised advertisements are delivered to a user by pulling on demand from a media storage device using SCTE-35 cues.
  • US2006075449 discloses a Distributed architecture for digital program insertion in video streams delivered over packet networks wherein a head-end unit inserts Internet Protocol (IP) splice points into a digital video transport stream, which is embedded with cue tone signals.
  • IP Internet Protocol
  • US2005015816 discloses a System and method of providing triggered event commands via digital program insertion splicing wherein a DVS380 compliant message which is extracted to determine the point at which local content is inserted.
  • the present invention eliminates the need for any intervention on the transmission source for identifying the point of insertion. This is accomplished by using an in-band marker on the network content (also referred to as a network content asset) to be replaced, instead of out-of-band marker.
  • This in-band marker is a combination of a watermark embedded into the content and fingerprints extracted from the content.
  • the watermark that is embedded can be composed of audio or video or some combination of the two.
  • the present invention proposes a system and method for seamless content insertion on network content by utilizing an in-band marker which is a combination of a watermark and a fingerprint that are used in conjunction to identify when and where to modify the main audio/video content.
  • an in-band marker which is a combination of a watermark and a fingerprint that are used in conjunction to identify when and where to modify the main audio/video content.
  • One of the applications of this invention is targeted for advertising where local ads are placed to cater to custmoers from different demographics.
  • a watermark which is essentially a steganographic code
  • the present invention uniquely identifies content assets.
  • a fingerprint is extracted for every content asset and further used to extract the salient features of the content, with high confidence and identify the exact point where the local content is to be inserted.
  • the following terms are used in this specification:
  • the present invention proposes a method and system for seamless content insertion of one or more local content assets, over one or more network/stored playout content assets wherein a point of insertion is identified using an in-band marker constructed by a combination of an audio-video watermark and a unique fingerprint, used to customize content transmitted over one or more varied delivery networks, comprising the steps of:
  • the present invention removes the limitation of dependence on the head-end within the transmission ecosystem. Furthermore, the method and system of the present invention allows high-speed fingerprint extraction and matching, in order to minimize the time taken to provide a high-confidence, valid match to the point of insertion on both audio and video.
  • the system does not impose specific fingerprinting or watermarking methods. There are multiple methods in public literature for fingerprinting [2] and for watermarking [3] that can be used as part of the proposed system.
  • FIG. 1 illustrates the system of existing TV transmission, supporting local content insertion.
  • FIG. 2 illustrates the system of TV reception supporting local content insertion.
  • FIG. 3 illustrates AMAGI-BARCODE-IN module.
  • FIG. 3 a illustrates a TV transmission system with proposed method supporting local content insertion alongside the AMAGI-BARCODE-IN module.
  • FIG. 4 illustrates AMAGI-BARCODE-OUT module.
  • FIG. 4 a illustrates steps performed in the AMAGI-BARCODE-OUT module.
  • FIG. 4 b illustrates a TV reception system with proposed method supporting local content insertion alongside the AMAGI-BARCODE-OUT module.
  • FIG. 4 c illustrates the hardware modules within the AMAGI-BARCODE-OUT module.
  • FIG. 1 A typical TV network transmission that enables content insertion is described in FIG. 1 .
  • Entertainment content [ 101 ] and ad content playout servers [ 102 ] play their respective content, based on a playout schedule determined by the T&B (traffic and billing) systems [ 103 ].
  • This content is passed through an audio-video encoder [ 104 ] and a multiplexer [ 105 ] before being up-linked onto a satellite [ 106 ], to be down-linked and distributed to subscriber homes.
  • T&B systems know the exact playout duration and sequence of every content asset. For accomplishing local content insertion, the T&B system is programmed so as to initiate a trigger to identify the exact point of insertion based on playout schedule.
  • This trigger is then encoded in time-sync with the content [ 107 ] and sent over the satellite.
  • an analog transmission it used to be a single audio tone.
  • a DTMF (dual-tone multi-frequency) cue tones or SCTE-35 markers are used.
  • the trigger from the T&B system [ 108 ] can either be a simple input trigger (GPI—General Purpose Input, programmed as a trigger), or could be a data-enabled message like SCTE-104.
  • GPI General Purpose Input, programmed as a trigger
  • SCTE-104 standard allows for providing additional information related to the to be replaced content asset, its duration and any other information that the T&B system can communicate with the local content insertion system post the down-link of content.
  • FIG. 2 A typical TV transmission downlink in a cable headend plant is shown in FIG. 2 .
  • the content is down-linked [ 201 ] and passed through a receiver [ 202 ], which decrypts the content (if encrypted at transmission source) and optionally decoded to be processed by the cable or satellite TV plant.
  • These receivers have features to extract a plurality of triggers sent from the transmission that is received time synchronized with the point of content insertion.
  • the content [ 203 ] along with the trigger [ 204 ] is fed to additional systems that either do the local content insertion [ 205 ] or translate the trigger to be sent over the cable or satellite TV plant, time synchronized with the content, to edge devices where local content insertion is accomplished.
  • TV networks have to identify the specific ad content asset that the advertisers want to replace with local ad content. This is done in their T&B systems, where the intention of the advertiser is translated to a trigger to be sent to the encoders by the T&B system.
  • the present invention introduces two modules, AMAGI-BARCODE-IN that is exercised prior to content transmission, and AMAGI-BARCODE-OUT which is used either at the point of local content insertion or at a point in the transmission chain where the content and the point of insertion needs to be ascertained.
  • Multiple video frames and audio samples within the content stream are analyzed in order to create high confidence during extraction. Since the present invention works in real-Time and has to process the watermarks and fingerprints quickly, the challenge of creating a method that can ascertain content to be replaced with high-confidence alongside having a robust system, which can work interoperably with existing television transmission systems has to be overcome.
  • the AMAGI-BARCODE-IN module is described in FIG. 3 .
  • the content asset [ 301 ] that is expected to be replaced with local content is sent through the AMAGI-BARCODE-IN module.
  • the AMAGI-BARCODE-IN [ 302 ] module embeds a unique content asset identifier (AMAGI-ID) as an audio-video watermark into the content asset.
  • AMAGI-ID unique content asset identifier
  • An audio-video watermark is a mechanism of embedding information onto the audio-video content, using the content as the carrier channel.
  • the key to a good watermark is that the embedding of information effects a minimal distortion in the audio, video of the content, which is invisible and inaudible to humans.
  • the module extracts a unique audio-video fingerprint (called AMAGI-FP) from the content asset and stores the same in a data store, AMAGI-ID-DB [ 303 ].
  • AMAGI-FP a unique audio-video fingerprint
  • Audio-video fingerprint is a form of low complexity compression of audio and video for purpose of unique identification of the content, rather than near-exact reproduction.
  • the fingerprinting method should be able to at low complexity extract unique features of audio and video into a pattern, which during reception can be extracted from the network content for matching with the already extracted pattern.
  • the content is watermarked with AMAGI-ID, the watermarked content being called AMAGI-WMED-CONTENT [ 304 ].
  • AMAGI-WMED-CONTENT [ 304 ] that is out of the AMAGI-BARCODE-IN module is further sent for transmission on a plurality of networks, in a plurality of different content formats.
  • FIG. 3 a describes sample TV transmission system with the proposed methods as part of the system.
  • Content asset that is to be replaced is passed through AMAGI-BARCODE-IN [ 501 ] to create AMAGI-WMED-CONTENT.
  • This is then passed on to a TV network's content data store [ 502 ] in a plurality of content formats as needed by the TV network.
  • T&B system [ 503 ] then schedules the playout of different content from its playout servers [ 504 ], including AMAGI-WMED-CONTENT based on a schedule.
  • AMAGI-WMED-CONTENT is treated as any other regular content going over the transmission stream.
  • FIG. 4 shows the AMAGI-BARCODE-OUT module [ 401 ] takes as input the transmitted content stream [ 402 ]. The module looks for the embedded watermark in select content on the transmitted content stream. If the watermark is found, the AMAGI-BARCODE-OUT module extracts AMAGI-ID from the content. The AMAGI-BARCODE-OUT module [ 401 ] then extracts audio-video fingerprints from the transmitted content stream and tries to match these fingerprints with plurality of AMAGI-FPs of different content assets stored in AMAGI-ID-DB [ 403 ]. We note that this data store was initially populated by the AMAGI-BARCODE-IN module.
  • a content asset's fingerprint is matched with one of the AMAGI-FPs (from the AMAGI-ID-DB [ 403 ]), that fingerprint is used along with the content-asset's watermark to identify the position of insertion of local content.
  • the module outputs AMAGI-ID [ 404 ] (uniquely identifying the content asset to be replaced) and the specific point of insertion [ 405 ] at specific points in the transmitted content stream, where the content asset is found in the stream.
  • FIG. 4 a shows the sequence of steps performed in the AMAGI-BARCODE-OUT module as it works on parsing [ 702 ] the transmitted network content [ 701 ] until [ 704 ] a valid watermark is found [ 702 ].
  • this module identifies it as a valid watermark [ 703 ], else no action is taken and it continues [ 704 ] to analyze the incoming stream.
  • the watermark is extracted from both audio and video to identify with high confidence the existence of AMAGI-ID watermark. In another possible embodiment, the watermark is extracted from either one of audio or video.
  • the step of fingerprint extraction and matching [ 706 ] is executed to identify the point of insertion of audio and video. This is done by running the steps of matching across a set duration of content from the point of watermark identification. In one embodiment, the fingerprint matching is exercised across say ⁇ 5 to +10 video frames, from the position of identification of the watermark. This provides the exact position of the point of insertion [ 707 ].
  • the module shall run two different fingerprint matching methods to identify different audio and video insertion points. In another possible embodiment, the module shall execute either one of audio or video fingerprint matching and use the point of insertion identified for either audio or video to estimate the point of insertion for both audio and video.
  • an audio-video delay element post the point of content analysis by AMAGI-BARCODE-OUT.
  • the duration of delay in the transmission chain is determined by the maximum of, (a) duration needed to extract the audio-video watermark and (b) duration needed to match extracted audio-video signature.
  • the specific duration for each of these is determined by the specific watermarking and fingerprint matching methods that are implemented as part of the system. Watermarking methods involves either using spread-spectrum, amplitude modulation or using quantization of signals. Fingerprinting methods use perceptual characteristics of the audio or video to extract a unique bit pattern that at best uniquely identifies the content.
  • FIG. 4 b illustrates a TV reception system with proposed method supporting local content insertion alongside the AMAGI-BARCODE-OUT module.
  • the audio-video content from the receiver [ 601 ] is passed to AMAGI-BARCODE-OUT module [ 602 ] as well as to an audio-video delay module [ 603 ].
  • the audio-video delay module is configured to delay the content stream to the extent needed by AMAGI-BARCODE-OUT to identify the content asset with the AMAGI-ID on the content stream as well as the specific point of insertion of local content.
  • This delayed content stream is then sent to additional systems that either do the local content insertion [ 604 ] or translate the trigger to be sent over the cable or satellite TV plant, time synchronized with the content, to edge devices where local content insertion is accomplished.
  • the AMAGI-BARCODE-OUT module as described in FIG. 4 c is a specialized hardware module that hosts the methods for watermarking, fingerprinting and barcode matching methods.
  • This hardware shall have interfaces to receive audio-video stream [ 801 ], signal processor for executing watermarking and fingerprint methods [ 802 ], and a general purpose processor for handling logic to match the extracted barcode with the stored candidate barcodes [ 803 ].
  • the AMAGI-BARCODE-OUT hardware supports pluraility of interfaces like SDI, HDMI, composite video and audio for receiving audio-video content stream in real-time.
  • All modules specified in this system are either standalone devices or can be part of another entity.
  • the AMAGI-BARCODE-OUT module, the audio-video delay module and the local content insertion system can be part of one entity. Those skilled in this art can come up with multiple different module configurations to accomplish the same.
  • viewer-specific local content insertion can be done on streamed content that replace a generic content asset inserted during content insertion.
  • the system allows for using AMAGI-BARCODE-OUT to identify the content asset and the point of insertion during the stored content playout, where during playout, based on the context, different content insertion can be accomplished at the point of insertion.

Abstract

System and method is proposed to extract audio-video fingerprint and embed an audio-video watermark onto a content asset that is transmitted as part of network content. It is proposed to identify uniquely the specific content to be replaced and the point of local content insertion by looking for the embedded audio-video watermark and match the audio-video fingerprint. The identified point of insertion enables seamless local content insertion.

Description

    FIELD OF THE INVENTION
  • The present invention relates to a System and method for seamless content insertion on transmitted network content using audio-video fingerprinting and watermarking.
  • BACKGROUND
  • Local content insertion on network content is accomplished to address targeted ad insertion or local content in cable Multiple System Operator (MSO) headends, set-top boxes and Internet browsers and in Internet content delivery networks. One application of local content insertion is targeted advertising wherein advertisements are placed to appeal to consumers or potential consumers, who are profiled based on demographics, besides a number of other variables. Advances in digital transmission technology enable new and improved methods for targeted advertising where advertisers can further verify the effectiveness of the ads and refine strategies utilizing quick feedback loops. Traditional broadcasting models where the same advertisement or content is seen by all users are slowly being replaced by local content insertion, to increase the spectrum of local advertisements. Amongst the first few advances in technology, video on demand (VOD) and switched digital video (SDV) enabled advertisers to customize content for specfic groups of viewers. Introduction of feedback loops from the viewer's end, back to the headend, further enabled point-wise monitoring of user-preferences. Several standards have also been introduced during this evolution including that from the Society of Cable Telecommunications Engineers, which define various standards for cable telecomm transmission systems. Advertising content is also changing simultaneously to include data tracks, over and above audio and video tracks in MPEG format [1].
  • PRIOR ART
  • In present-day systems, in order to identify the point of insertion, the network content that is streamed currently provides an out-of-band identification marker that determines the point of insertion that guarantees seamless local content insertion. This marker could be a Dual Tone Multi Frequency (DTMF) cue tone, an SCTE-35 message or an Adobe Flash ad insertion trigger. This marker is sent over the network from the source of transmission in an out-of-band manner synchronized with the point of insertion on the audio-video content. In these current systems there is a need to identify each point of insertion at the transmission source to identify and send relevant markers for the identified point of insertion. WO 2006097825 discloses a System and Method for household-targeted advertising wherein as Set Top Box (essentially housed at a consumer's premises) has targeted ads delivered to it, wherein the system is specifically programmed for an IP stack. In this patent, customised advertisements are delivered to a user by pulling on demand from a media storage device using SCTE-35 cues. US2006075449 discloses a Distributed architecture for digital program insertion in video streams delivered over packet networks wherein a head-end unit inserts Internet Protocol (IP) splice points into a digital video transport stream, which is embedded with cue tone signals. By using a splicing device downstream, this invention later inserts a specific ad at the splice points inserted by the head-end to customise the content based on demographic information. US2005015816 discloses a System and method of providing triggered event commands via digital program insertion splicing wherein a DVS380 compliant message which is extracted to determine the point at which local content is inserted.
  • In these works of prior art, a complex workflow is implicit when there are hundreds of advertising content planned for local replacements across different points of insertion. In addition, the replacement of local content is predicated by the need for the marker, in several incompatible formats across transmission systems, to be inserted by the transmission source, which entails the need for the transmission business to participate in this ecosystem of local content insertion. This requirement renders present day systems top-heavy and unrealistic in the long term besides demonstrating no potential for increasing the spectrum of local ads as the methods to insert them seamlessly do not exist.
  • There are systems available in the market that uses audio-video fingerprinting only to effect local content insertion like DVEO's Gen2 ad insertion system and patent applications like “Video Detection and Insertion ”, by Konig; Richard; (US); et al. The key problem with these systems is the ability to uniquely identify content assets to be replaced. Given that there can be multiple content assets with long duration of content remaining the same, to identify unique assets using fingerprints warrants capturing and matching the fingerprints for duration longer than the anticipated duration of same audio-video content, in non-unique content assets. This leads to a limitation of the system to anticipate and design the workflow for the worst-case duration of same audio-video on non-unique content assets.
  • The present invention eliminates the need for any intervention on the transmission source for identifying the point of insertion. This is accomplished by using an in-band marker on the network content (also referred to as a network content asset) to be replaced, instead of out-of-band marker. This in-band marker is a combination of a watermark embedded into the content and fingerprints extracted from the content. The watermark that is embedded can be composed of audio or video or some combination of the two. At the point of local insertion, where a local content asset replaces the network content asset, the method and system of the present invention are able to uniquely identify the network content asset to be replaced and identify the specific point of insertion that guarantees seamless local content insertion, in real-time.
  • SUMMARY
  • The present invention proposes a system and method for seamless content insertion on network content by utilizing an in-band marker which is a combination of a watermark and a fingerprint that are used in conjunction to identify when and where to modify the main audio/video content. One of the applications of this invention is targeted for advertising where local ads are placed to cater to custmoers from different demographics. By inserting a watermark, which is essentially a steganographic code, the present invention uniquely identifies content assets. A fingerprint is extracted for every content asset and further used to extract the salient features of the content, with high confidence and identify the exact point where the local content is to be inserted. The following terms are used in this specification:
      • 1. Network Content Asset—This refers to the original content that was being transmitted or broadcast over a delivery network.
      • 2. Local Content Asset—This refers to the customized content (for example, a local language advertisement to be transmitted during the break in the 7 pm news in one language in one state and another language in another state).
      • 3. “AMAGI-ID”—This refers to the unique audio-video watermark that is inserted into the network content asset.
      • 4. “AMAGI-WMED-CONTENT”—This refers to the watermarked network content asset.
      • 5. “AMAGI-FP”—This refers to the fingerprint that is extracted from the network content asset.
      • 6. “AMAGI-ID-DB”—This refers to a data store used in the present invention.
      • 7. “AMAGI-BARCODE-IN”—This refers to a module that is introduced in the present invention, prior to content transmission.
      • 8. “AMAGI-BARCODE-OUT”—This refers to a module that is introduced in the present invention, at the point of local content insertion or at a point in the transmission chain where the content and the point of insertion needs to be ascertained.
  • The present invention proposes a method and system for seamless content insertion of one or more local content assets, over one or more network/stored playout content assets wherein a point of insertion is identified using an in-band marker constructed by a combination of an audio-video watermark and a unique fingerprint, used to customize content transmitted over one or more varied delivery networks, comprising the steps of:
      • a. Embedding a unique network content asset identifier “AMAGI-ID” as an audio-video watermark, into the network content asset, the watermarked network content asset being labeled “AMAGI-WMED-CONTENT”;
      • b. Extracting a unique audio-video fingerprint “AMAGI-FP” from the network content asset;
      • c. Storing “AMAGI-ID” and “AMAGI-FP” in a data store “AMAGI-ID-DB”;
      • d. Transmitting “AMAGI-WMED-CONTENT” over one or more networks in one or more content formats;
      • e. Introducing a delay in the delivery of the network in order to identify the watermark and the fingerprint during network reception;
      • f. Monitoring the network/stored playout content assets at the time of reception to detect “AMAGI-WMED-CONTENT”;
      • g. Upon detecting previously inserted watermarks (in step a) in “AMAGI-WMED-CONTENT”:
        • i. Extracting the corresponding “AMAGI-ID” from the network content asset;
        • ii. Extracting corresponding fingerprints for the network content asset;
        • iii. Matching the extracted fingerprints obtained in step g(ii) with one or more previously stored “AMAGI-FP” in the “AMAGI-ID-DB”;
        • iv. Upon finding a match in step g(iii):
          • a) Delivering the local content asset at the correct position in the stream of network/stored playout content assets, thereby replacing the network content asset with a suitable local content asset.
  • The present invention removes the limitation of dependence on the head-end within the transmission ecosystem. Furthermore, the method and system of the present invention allows high-speed fingerprint extraction and matching, in order to minimize the time taken to provide a high-confidence, valid match to the point of insertion on both audio and video. The system does not impose specific fingerprinting or watermarking methods. There are multiple methods in public literature for fingerprinting [2] and for watermarking [3] that can be used as part of the proposed system.
  • BRIEF DESCRIPTION OF THE DRAWINGS
  • FIG. 1 illustrates the system of existing TV transmission, supporting local content insertion.
  • FIG. 2 illustrates the system of TV reception supporting local content insertion.
  • FIG. 3 illustrates AMAGI-BARCODE-IN module.
  • FIG. 3 a illustrates a TV transmission system with proposed method supporting local content insertion alongside the AMAGI-BARCODE-IN module.
  • FIG. 4 illustrates AMAGI-BARCODE-OUT module.
  • FIG. 4 a illustrates steps performed in the AMAGI-BARCODE-OUT module.
  • FIG. 4 b illustrates a TV reception system with proposed method supporting local content insertion alongside the AMAGI-BARCODE-OUT module.
  • FIG. 4 c illustrates the hardware modules within the AMAGI-BARCODE-OUT module.
  • DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS
  • A typical TV network transmission that enables content insertion is described in FIG. 1. Entertainment content [101] and ad content playout servers [102] play their respective content, based on a playout schedule determined by the T&B (traffic and billing) systems [103]. This content is passed through an audio-video encoder [104] and a multiplexer [105] before being up-linked onto a satellite [106], to be down-linked and distributed to subscriber homes. T&B systems know the exact playout duration and sequence of every content asset. For accomplishing local content insertion, the T&B system is programmed so as to initiate a trigger to identify the exact point of insertion based on playout schedule. This trigger is then encoded in time-sync with the content [107] and sent over the satellite. In the case of an analog transmission, it used to be a single audio tone. In case of digital content transmission, a plurality of options, either a DTMF (dual-tone multi-frequency) cue tones or SCTE-35 markers are used. The trigger from the T&B system [108] can either be a simple input trigger (GPI—General Purpose Input, programmed as a trigger), or could be a data-enabled message like SCTE-104. In a simple GPI, the position of insertion is identified, but no information is transmitted related to the content asset that is to be replaced. SCTE-104 standard allows for providing additional information related to the to be replaced content asset, its duration and any other information that the T&B system can communicate with the local content insertion system post the down-link of content.
  • A typical TV transmission downlink in a cable headend plant is shown in FIG. 2. At the cable or satellite TV headend, the content is down-linked [201] and passed through a receiver [202], which decrypts the content (if encrypted at transmission source) and optionally decoded to be processed by the cable or satellite TV plant. These receivers have features to extract a plurality of triggers sent from the transmission that is received time synchronized with the point of content insertion. The content [203] along with the trigger [204] is fed to additional systems that either do the local content insertion [205] or translate the trigger to be sent over the cable or satellite TV plant, time synchronized with the content, to edge devices where local content insertion is accomplished. To enable local ad content insertion in a TV transmission system, TV networks have to identify the specific ad content asset that the advertisers want to replace with local ad content. This is done in their T&B systems, where the intention of the advertiser is translated to a trigger to be sent to the encoders by the T&B system.
  • The present invention introduces two modules, AMAGI-BARCODE-IN that is exercised prior to content transmission, and AMAGI-BARCODE-OUT which is used either at the point of local content insertion or at a point in the transmission chain where the content and the point of insertion needs to be ascertained. Multiple video frames and audio samples within the content stream are analyzed in order to create high confidence during extraction. Since the present invention works in real-Time and has to process the watermarks and fingerprints quickly, the challenge of creating a method that can ascertain content to be replaced with high-confidence alongside having a robust system, which can work interoperably with existing television transmission systems has to be overcome.
  • The AMAGI-BARCODE-IN module is described in FIG. 3. The content asset [301] that is expected to be replaced with local content is sent through the AMAGI-BARCODE-IN module. The AMAGI-BARCODE-IN [302] module embeds a unique content asset identifier (AMAGI-ID) as an audio-video watermark into the content asset. An audio-video watermark is a mechanism of embedding information onto the audio-video content, using the content as the carrier channel. The key to a good watermark is that the embedding of information effects a minimal distortion in the audio, video of the content, which is invisible and inaudible to humans. In addition the information should survive different network transmission systems and compression/decompression of content and be able to reliably extract at the receiver end. In addition the module extracts a unique audio-video fingerprint (called AMAGI-FP) from the content asset and stores the same in a data store, AMAGI-ID-DB [303].
  • Audio-video fingerprint is a form of low complexity compression of audio and video for purpose of unique identification of the content, rather than near-exact reproduction. The fingerprinting method should be able to at low complexity extract unique features of audio and video into a pattern, which during reception can be extracted from the network content for matching with the already extracted pattern.
  • The content is watermarked with AMAGI-ID, the watermarked content being called AMAGI-WMED-CONTENT [304]. AMAGI-WMED-CONTENT [304] that is out of the AMAGI-BARCODE-IN module is further sent for transmission on a plurality of networks, in a plurality of different content formats.
  • FIG. 3 a describes sample TV transmission system with the proposed methods as part of the system. Content asset that is to be replaced is passed through AMAGI-BARCODE-IN [501] to create AMAGI-WMED-CONTENT. This is then passed on to a TV network's content data store [502] in a plurality of content formats as needed by the TV network. T&B system [503] then schedules the playout of different content from its playout servers [504], including AMAGI-WMED-CONTENT based on a schedule. In the proposed method, there is no need for the T&B system to be aware of the existence of content to be replaced and hence the absence of need for identifying point of local content insertion in the transmitted content stream. AMAGI-WMED-CONTENT is treated as any other regular content going over the transmission stream.
  • FIG. 4 shows the AMAGI-BARCODE-OUT module [401] takes as input the transmitted content stream [402]. The module looks for the embedded watermark in select content on the transmitted content stream. If the watermark is found, the AMAGI-BARCODE-OUT module extracts AMAGI-ID from the content. The AMAGI-BARCODE-OUT module [401] then extracts audio-video fingerprints from the transmitted content stream and tries to match these fingerprints with plurality of AMAGI-FPs of different content assets stored in AMAGI-ID-DB [403]. We note that this data store was initially populated by the AMAGI-BARCODE-IN module. If a content asset's fingerprint is matched with one of the AMAGI-FPs (from the AMAGI-ID-DB [403]), that fingerprint is used along with the content-asset's watermark to identify the position of insertion of local content. The module outputs AMAGI-ID [404] (uniquely identifying the content asset to be replaced) and the specific point of insertion [405] at specific points in the transmitted content stream, where the content asset is found in the stream.
  • FIG. 4 a shows the sequence of steps performed in the AMAGI-BARCODE-OUT module as it works on parsing [702] the transmitted network content [701] until [704] a valid watermark is found [702]. In the case of AMAGI-WMED-CONTENT coming through the network content stream, this module identifies it as a valid watermark [703], else no action is taken and it continues [704] to analyze the incoming stream. Once a valid watermark is found, it is extracted and returned as AMAGI-ID [705], the unique content asset identifier that was embedded as part of AMAGI-BARCODE-IN module.
  • In one possible embodiment, the watermark is extracted from both audio and video to identify with high confidence the existence of AMAGI-ID watermark. In another possible embodiment, the watermark is extracted from either one of audio or video. Once the availability of AMAGI-WMED-CONTENT is ascertained, the step of fingerprint extraction and matching [706] is executed to identify the point of insertion of audio and video. This is done by running the steps of matching across a set duration of content from the point of watermark identification. In one embodiment, the fingerprint matching is exercised across say −5 to +10 video frames, from the position of identification of the watermark. This provides the exact position of the point of insertion [707]. In another possible embodiment, the module shall run two different fingerprint matching methods to identify different audio and video insertion points. In another possible embodiment, the module shall execute either one of audio or video fingerprint matching and use the point of insertion identified for either audio or video to estimate the point of insertion for both audio and video.
  • Given that the system works on analyzing audio-video content during AMAGI-BARCODE-OUT, it becomes imperative that the overall transmitted content is delayed at the point of analysis or further down the content flow, so as to enable specific insertion action based on analyzed content. In systems implementing the present invention, there shall be an audio-video delay element post the point of content analysis by AMAGI-BARCODE-OUT. The duration of delay in the transmission chain is determined by the maximum of, (a) duration needed to extract the audio-video watermark and (b) duration needed to match extracted audio-video signature. The specific duration for each of these is determined by the specific watermarking and fingerprint matching methods that are implemented as part of the system. Watermarking methods involves either using spread-spectrum, amplitude modulation or using quantization of signals. Fingerprinting methods use perceptual characteristics of the audio or video to extract a unique bit pattern that at best uniquely identifies the content.
  • FIG. 4 b illustrates a TV reception system with proposed method supporting local content insertion alongside the AMAGI-BARCODE-OUT module. In this system, the audio-video content from the receiver [601] is passed to AMAGI-BARCODE-OUT module [602] as well as to an audio-video delay module [603]. The audio-video delay module is configured to delay the content stream to the extent needed by AMAGI-BARCODE-OUT to identify the content asset with the AMAGI-ID on the content stream as well as the specific point of insertion of local content. This delayed content stream is then sent to additional systems that either do the local content insertion [604] or translate the trigger to be sent over the cable or satellite TV plant, time synchronized with the content, to edge devices where local content insertion is accomplished.
  • The AMAGI-BARCODE-OUT module as described in FIG. 4 c is a specialized hardware module that hosts the methods for watermarking, fingerprinting and barcode matching methods. This hardware shall have interfaces to receive audio-video stream [801], signal processor for executing watermarking and fingerprint methods [802], and a general purpose processor for handling logic to match the extracted barcode with the stored candidate barcodes [803]. The AMAGI-BARCODE-OUT hardware supports pluraility of interfaces like SDI, HDMI, composite video and audio for receiving audio-video content stream in real-time.
  • It is to be noted that specific methods for (a) embedding and extracting of audio-video watermark and (b) extraction and matching of audio-video fingerprints are outside the scope of this invention and has no direct bearing on the proposed system. It is assumed that those skilled in this art shall be able to leverage a plurality of different methods to accomplish the same desired effect. It also to be noted that the proposed system is no way limited to watermarking and fingerprinting to be done on both audio and video. It is possible for the system to be implemented with a plurality of either audio, video or with both audio and video processing for watermarking and fingerprint matching.
  • All modules specified in this system are either standalone devices or can be part of another entity. In one possible implementation, the AMAGI-BARCODE-OUT module, the audio-video delay module and the local content insertion system can be part of one entity. Those skilled in this art can come up with multiple different module configurations to accomplish the same.
  • Those skilled in this art shall be able to appreciate the spirit of this invention and its application to one or more varied system implementations and transmission networks, including but not limited to TV content transmission. The system could also be used for content which is either only audio or only video or on different tracks of audio in multi-track audio-video content. This invention can also be applied to any of the other content transmission networks including the Internet and on stored content. In an Internet content transmission network, by using AMAGI-BARCODE-IN method on selected content assets during content creation, the system allows for using AMAGI-BARCODE-OUT to identify the content asset and the point of insertion during unicast or multicast streaming to viewers. Using the approach, viewer-specific local content insertion can be done on streamed content that replace a generic content asset inserted during content insertion. In a stored content, by using AMAGI-BARCODE-IN method on selected content assets during content creation, the system allows for using AMAGI-BARCODE-OUT to identify the content asset and the point of insertion during the stored content playout, where during playout, based on the context, different content insertion can be accomplished at the point of insertion.
  • REFERENCES
    • 1. http://www.cable360.net/ct/strategy/businesscases/Adaptive-Advertising-Definitions-Scenarios-and-Standards21347.html
    • 2. “A Highly Robust Audio Fingerprinting System”, by Jaap Haitsma et al
    • 3. “A Video Watermarking System for Broadcast Monitoring”, by Jaap Haitsma et al

Claims (30)

1. A method for seamless content insertion of one or more local content assets, over one or more network/stored playout content assets, wherein a point of insertion is identified using an in-band marker constructed by a combination of an audio-video watermark and a unique fingerprint, used to customize content transmitted over one or more varied delivery networks, comprising the steps of:
a. Embedding a unique network content asset identifier “AMAGI-ID” as an audio-video watermark, into the network content asset, the watermarked network content asset being labeled “AMAGI-WMED-CONTENT”;
b. Extracting a unique audio-video fingerprint “AMAGI-FP” from the network content asset;
c. Storing “AMAGI-ID” and “AMAGI-FP” in a data store “AMAGI-ID-DB”;
d. Transmitting “AMAGI-WMED-CONTENT” over one or more networks in one or more content formats;
e. Introducing a delay in the delivery of the network in order to identify the watermark and the fingerprint during network reception;
f. Monitoring the network/stored playout content assets at the time of reception to detect “AMAGI-WMED-CONTENT”;
g. Upon detecting previously inserted watermarks (in step a) in “AMAGI-WMED-CONTENT”:
i. Extracting the corresponding “AMAGI-ID” from the network content asset;
ii. Extracting corresponding fingerprints for the network content asset; and
iii. Matching the extracted fingerprints obtained in step g(ii) with one or more previously stored “AMAGI-FP” in the “AMAGI-ID-DB”;
h. Upon finding a match in step g(iii):
b) Delivering the local content asset at the correct position in the stream of network/stored playout content assets, thereby replacing the network content asset with a suitable local content asset.
2. A method for seamless content insertion of one or more local content assets, over one or more network/stored playout content assets using an in-band marker of claim 1 wherein:
a. One or more watermarking methods using spread-spectrum, amplitude modulation or using quantization of signals, can be used;
b. The watermark uses the network content asset as the carrier channel;
c. The watermark effects minimal distortion in the audio and video within the network content asset; and
d. The watermark is independent of the delivery networks, which might have different compression and/or decompression of the audio-video content, and yet the watermark is reliably transmitted.
3. A method for seamless content insertion of one or more local content assets, over one or more network/stored playout content assets using an in-band marker of claim 1 wherein:
a. One or more fingerprinting methods using perceptual characteristics of audio or video streams within one or more network/stored playout content assets, can be used to extract a unique bit pattern that at best uniquely identifies the network/stored playout content assets; and
b. Fingerprinting methods used utilize a hash function or a low complexity compression of audio and or video for the purpose of unique identification of the content, rather than near-exact reproduction;
4. A method for seamless content insertion of one or more local content assets, over one or more network/stored playout content assets using an in-band marker of claim 1 wherein if the fingerprint extracted in step g(ii) match with a unique “AMAGI-FP” in “AMAGI-ID-DB”, it is used along with the previously inserted watermark detected to identify the position of insertion of local content assets in the stream of network/stored playout content assets and the steps of watermarking and fingerprinting can be performed interchangably.
5. A method for seamless content insertion of one or more local content assets, over one or more network/stored playout content assets using an in-band marker of claim 1 wherein the delay introduced in step 1(e) is determined as the maximum value between:
a. The duration elapsing during extraction of the audio-video watermark; and
b. The duration elapsing during matching the extracted fingerprint.
6. A method for seamless content insertion of one or more local content assets, over one or more network/stored playout content assets using an in-band marker of claim 1 wherein:
a. An “AMAGI-BARCODE-IN” module performs steps a-d; and
b. An “AMAGI-BARCODE-OUT” module performs steps f and g.
7. A method for seamless content insertion of one or more local content assets, over one or more network/stored playout content assets using an in-band marker of claim 1 wherein the watermark is extracted from both audio and video of the network content asset or from either one of audio or video to identify the existence of “AMAGI-ID” watermark.
8. A method for seamless content insertion of one or more local content assets, over one or more network/stored playout content assets using an in-band marker of claim 1 wherein the fingerprint matching is exercised across −5 to +10 video frames, from the position of identification of the watermark in order to obtain the exact position of the point of insertion.
9. A method for seamless content insertion of one or more local content assets, over one or more network/stored playout content assets using an in-band marker of claim 1 wherein one or more fingerprint matching procedures to identify different audio and video insertion points.
10. A method for seamless content insertion of one or more local content assets, over one or more network/stored playout content assets using an in-band marker of claim 1 wherein either one of audio or video fingerprint matching and use the point of insertion identified for either audio or video to estimate the point of insertion for both audio and video.
11. A method for seamless content insertion of one or more local content assets, over one or more network/stored playout content assets using an in-band marker of claim 1 used in cable head-ends, set-top boxes, internet browsers and internet content delivery networks using audio-video fingerprinting and water marking with (a) “AMAGI-BARCODE-IN” module, (b) an entertainment content play out server, (c) an ad content play out server, (d) traffic-billing system, (e) one or more encoder or decoder, (f) “AMAGI-BARCODE-OUT” module and (g) cable and satellite TV plants configured to support a plurality of networks, in a plurality of content formats, further comprising the steps of:
a. Performing in the “AMAGI-BARCODE-IN” module the steps of:
i. Embedding a unique network content asset identifier “AMAGI-ID” as an audio-video watermark, into the network content asset, the watermarked network content asset being labeled “AMAGI-WMED-CONTENT”;
ii. Extracting a unique audio-video fingerprint “AMAGI-FP” from the network content asset;
iii. Storing “AMAGI-ID” and “AMAGI-FP” in a data store “AMAGI-ID-DB”; and
iv. Transmitting “AMAGI-WMED-CONTENT” over one or more networks in one or more content formats;
b. Introducing a delay in the delivery of the network/stored playout content assets in order to identify the watermark and the fingerprint during network reception;
c. Performing in the “AMAGI-BARCODE-IN” module the steps of:
i. Monitoring the network/stored playout content assets at the time of reception to detect “AMAGI-WMED-CONTENT”;
ii. Upon detecting previously inserted watermarks (in step a) in “AMAGI-WMED-CONTENT”:
a) Extracting the corresponding “AMAGI-ID” from the network content asset;
b) Extracting corresponding fingerprints for the network content asset; and
c) Matching the extracted fingerprints obtained in step (b) with one or more previously stored “AMAGI-FP” in the “AMAGI-ID-DB”;
iii. Upon finding a match in step (c(ii)c):
a) Delivering the local content asset at the correct position in the stream of network/stored playout content assets, thereby replacing the network content asset with a suitable local content asset.
12. A method for seamless content insertion of one or more local content assets, over one or more network/stored playout content assets using an in-band marker of claim 11 wherein:
a. One or more watermarking methods using spread-spectrum, amplitude modulation or using quantization of signals, can be used;
b. The watermark uses the network content asset as the carrier channel;
c. The watermark effects minimal distortion in the audio and video within the network content asset; and
d. The watermark is independent of the delivery networks, which might have different compression and/or decompression of the audio-video content, and yet the watermark is reliably transmitted.
13. A method for seamless content insertion of one or more local content assets, over one or more network/stored playout content assets using an in-band marker of claim 11 wherein:
a. One or more fingerprinting methods using perceptual characteristics of audio or video streams within one or more network/stored playout content assets, can be used to extract a unique bit pattern that at best uniquely identifies the network/stored playout content assets; and
b. Fingerprinting methods used utilize a hash function or a low complexity compression of audio and or video for the purpose of unique identification of the content, rather than near-exact reproduction.
14. A method for seamless content insertion of one or more local content assets, over one or more network/stored playout content assets using an in-band marker of claim 11 claim 11 wherein if the fingerprint extracted in step g(ii) match with a unique “AMAGI-FP” in “AMAGI-ID-DB”, it is used along with the previously inserted watermark detected to identify the position of insertion of local content assets in the stream of network/stored playout content assets and the steps of watermarking and fingerprinting can be performed interchangably.
15. A method for seamless content insertion of one or more local content assets, over one or more network/stored playout content assets using an in-band marker of claim 11 wherein the delay introduced in step 1(e) is determined as the maximum value between:
a. The duration elapsing during extraction of the audio-video watermark; and
b. The duration elapsing during matching the extracted fingerprint.
16. A method for seamless content insertion of one or more local content assets, over one or more network/stored playout content assets using an in-band marker of claim 11 wherein watermark is extracted from both audio and video of the network content asset or from either one of audio or video to identify the existence of “AMAGI-ID” watermark.
17. A method for seamless content insertion of one or more local content assets, over one or more network/stored playout content assets using an in-band marker of claim 11 wherein the fingerprint matching is exercised across −5 to +10 video frames, from the position of identification of the watermark in order to obtain the exact position of the point of insertion.
18. A method for seamless content insertion of one or more local content assets, over one or more network/stored playout content assets using an in-band marker of claim 11 wherein one or more fingerprint matching procedures can be used to identify different audio and video insertion points.
19. A method for seamless content insertion of one or more local content assets, over one or more network/stored playout content assets using an in-band marker of claim 11 wherein either one of audio or video fingerprint matching and use the point of insertion identified for either audio or video to estimate the point of insertion for both audio and video.
20. A method for seamless content insertion of one or more local content assets, over one or more network/stored playout content assets using an in-band marker of claim 11 wherein “AMAGI-BARCODE-IN” can be either of a standalone module or integrated as part of one or more pre-existing content play out, content editing, content post-production or content encoding devices.
21. A method for seamless content insertion of one or more local content assets, over one or more network/stored playout content assets using an in-band marker of claim 11 wherein “AMAGI-BARCODE-OUT” can be either of a standalone module or integrated as part of one or more pre-existing content receiver, content insertion, content decoder, or content rendering devices.
22. A method for seamless content insertion of one or more local content assets, over one or more network/stored playout content assets using an in-band marker of claim 1 used in Internet content transmission networks by using “AMAGI-BARCODE-IN” on selected internet content assets during content creation, such that “AMAGI-BARCODE-OUT” is used to identify the watermarked internet content asset along with the point of insertion during unicast or multicast streaming to viewers.
23. A system for seamless content insertion of one or more local content assets, over one or more network/stored playout content assets using an in-band marker used in cable head-ends, set-top boxes, internet browsers and internet content delivery networks using audio-video fingerprinting and water marking with (a) “AMAGI-BARCODE-IN” subsystem, (b) an entertainment content play out server, (c) an ad content play out server, (d) traffic-billing system, (e) one or more encoder or decoder, (f) “AMAGI-BARCODE-OUT” subsystem and (g) cable and satellite TV plants configured to support a plurality of networks, in a plurality of content formats, wherein:
a. The “AMAGI-BARCODE-IN” subsystem is configured to:
i. Embed a unique network content asset identifier “AMAGI-ID” as an audio-video watermark, into the network content asset, the watermarked network content asset being labeled “AMAGI-WMED-CONTENT”;
ii. Extract a unique audio-video fingerprint “AMAGI-FP” from the network content asset;
iii. Store “AMAGI-ID” and “AMAGI-FP” in a data store “AMAGI-ID-DB”;
iv. Transmit “AMAGI-WMED-CONTENT” over one or more networks in one or more content formats;
b. A delay is introduced in the delivery of the network/stored playout content assets in order to identify the watermark and the fingerprint during network reception;
c. The “AMAGI-BARCODE-IN” subsystem is configured to:
i. Monitor the network/stored playout content assets at the time of reception to detect “AMAGI-WMED-CONTENT”;
ii. Upon detecting previously inserted watermarks (in step a) in “AMAGI-WMED-CONTENT”:
a) Extract the corresponding “AMAGI-ID” from the network content asset;
b) Extract corresponding fingerprints for the network content asset; and
c) Match the extracted fingerprints obtained in step (b) with one or more previously stored “AMAGI-FP” in the “AMAGI-ID-DB”;
iii. Upon finding a match in (c (ii) c) the local content asset is delivered at the correct position in the stream of network/stored playout content assets, thereby replacing the network content asset with a suitable local content asset.
24. A system for seamless content insertion of one or more local content assets, over one or more network/stored playout content assets using an in-band marker of claim 23 wherein:
a. One or more watermarking methods using spread-spectrum, amplitude modulation or using quantization of signals, can be used;
b. The watermark uses the network content asset as the carrier channel;
c. The watermark effects minimal distortion in the audio and video within the network content asset; and
d. The watermark is independent of the delivery networks; which might have different compression and/or decompression of the audio-video content, and yet the watermark is reliably transmitted.
25. A system for seamless content insertion of one or more local content assets, over one or more network/stored playout content assets using an in-band marker of claim 23 wherein:
a. One or more fingerprinting methods using perceptual characteristics of audio or video streams within one or more network/stored playout content assets, can be used to extract a unique bit pattern that at best uniquely identifies the network/stored playout content assets; and
b. Fingerprinting methods used utilize a hash function or a low complexity compression of audio and or video for the purpose of unique identification of the content, rather than near-exact reproduction.
26. A system for seamless content insertion of one or more local content assets, over one or more network/stored playout content assets using an in-band marker of claim 23 claim 11 wherein if the fingerprint extracted in step g(ii) match with a unique “AMAGI-FP” in “AMAGI-ID-DB”, it is used along with the previously inserted watermark detected to identify the position of insertion of local content assets in the stream of network/stored playout content assets and the steps of watermarking and fingerprinting can be performed interchangably.
27. A system for seamless content insertion of one or more local content assets, over one or more network/stored playout content assets using an in-band marker of claim 23 wherein the delay introduced is determined as the maximum value between:
a. The duration elapsing during extraction of the audio-video watermark; and
b. The duration elapsing during matching the extracted fingerprint.
28. A system for seamless content insertion of one or more local content assets, over one or more network/stored playout content assets using an in-band marker of claim 23 wherein the watermark is extracted from both audio and video of the network content asset or from either one of audio or video to identify the existence of “AMAGI-ID” watermark.
29. A system for seamless content insertion of one or more local content assets, over one or more network/stored playout content assets using an in-band marker of claim 23 wherein:
a. The fingerprint matching is exercised across −5 to +10 video frames, from the position of identification of the watermark in order to obtain the exact position of the point of insertion;
b. One or more fingerprint matching procedures are used to identify different audio and video insertion points; and
c. Either one of audio or video fingerprint matching and use the point of insertion identified for either audio or video to estimate the point of insertion for both audio and video.
30. A system for seamless content insertion of one or more local content assets, over one or more network/stored playout content assets using an in-band marker of claim 23 wherein:
a. The “AMAGI-BARCODE-IN” can be either of a standalone module or integrated as part of one or more pre-existing content play out, content editing, content post-production or content encoding devices; and
b. The “AMAGI-BARCODE-OUT” can be either of a standalone module or integrated as part of one or more pre-existing content receiver, content insertion, content decoder, or content rendering devices.
US14/131,504 2011-07-12 2012-07-12 System and method for seamless content insertion on network content using audio-video fingerprinting and watermarking Abandoned US20140150021A1 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
US14/131,504 US20140150021A1 (en) 2011-07-12 2012-07-12 System and method for seamless content insertion on network content using audio-video fingerprinting and watermarking

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
US201161506853P 2011-07-12 2011-07-12
US14/131,504 US20140150021A1 (en) 2011-07-12 2012-07-12 System and method for seamless content insertion on network content using audio-video fingerprinting and watermarking
PCT/IN2012/000493 WO2013008255A2 (en) 2011-07-12 2012-07-12 System and method for seamless content insertion on network content using audio-video fingerprinting and watermarking

Publications (1)

Publication Number Publication Date
US20140150021A1 true US20140150021A1 (en) 2014-05-29

Family

ID=47506640

Family Applications (2)

Application Number Title Priority Date Filing Date
US13/544,234 Active 2032-12-27 US8737679B2 (en) 2011-07-12 2012-07-09 System and method for seamless content insertion on network content using audio-video fingerprinting and watermarking
US14/131,504 Abandoned US20140150021A1 (en) 2011-07-12 2012-07-12 System and method for seamless content insertion on network content using audio-video fingerprinting and watermarking

Family Applications Before (1)

Application Number Title Priority Date Filing Date
US13/544,234 Active 2032-12-27 US8737679B2 (en) 2011-07-12 2012-07-09 System and method for seamless content insertion on network content using audio-video fingerprinting and watermarking

Country Status (2)

Country Link
US (2) US8737679B2 (en)
WO (1) WO2013008255A2 (en)

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20150281776A1 (en) * 2014-03-31 2015-10-01 Samarth Desai System and method for targeted advertising
WO2020139608A1 (en) * 2018-12-26 2020-07-02 The Nielsen Company (Us), Llc Methods and apparatus to optimize reference signature matching using watermark matching
US10713495B2 (en) 2018-03-13 2020-07-14 Adobe Inc. Video signatures based on image feature extraction
WO2021050376A1 (en) * 2019-09-13 2021-03-18 The Nielsen Company (Us), Llc Use of in-band metadata as basis to access reference fingerprints to facilitate content-related action
US11259058B2 (en) * 2019-03-25 2022-02-22 Apple Inc. Use of rendered media to assess delays in media distribution systems
WO2023278251A1 (en) * 2021-06-30 2023-01-05 The Nielsen Company (Us), Llc Methods and apparatus to credit unidentified media
US20230129395A1 (en) * 2021-10-26 2023-04-27 Apple Inc. Synchronized playback of media content

Families Citing this family (16)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105843968A (en) * 2010-05-04 2016-08-10 沙扎姆娱乐有限公司 Methods for syschronizing media
ITTO20110723A1 (en) * 2011-08-03 2013-02-04 Csp A Innovazione Nelle Ict Scarl METHODS AND APPLIANCES FOR TRANSMITTING AND RECEIVING MULTIMEDIA CONTENT
EP2595399A1 (en) * 2011-11-16 2013-05-22 Thomson Licensing Method of digital content version switching and corresponding device
US10158927B1 (en) * 2012-09-05 2018-12-18 Google Llc Systems and methods for detecting audio-video synchronization using timestamps
FR3007614B1 (en) * 2013-06-21 2015-06-26 France Brevets DEVICE AND METHOD FOR COMPOSITION OF PROGRAMS OF DIFFERENT SOURCES IN BASE BAND
EP2876890A1 (en) 2013-11-21 2015-05-27 Thomson Licensing Method and apparatus for frame accurate synchronization of video streams
US9621963B2 (en) 2014-01-28 2017-04-11 Dolby Laboratories Licensing Corporation Enabling delivery and synchronization of auxiliary content associated with multimedia data using essence-and-version identifier
CN105451034B (en) * 2015-12-04 2019-08-27 优酷网络技术(北京)有限公司 A kind of advertisement control method for playing back and system
US10707980B2 (en) * 2016-05-06 2020-07-07 Iheartmedia Management Services, Inc. Substituting streaming station for over-the-air broadcast
CN105933741A (en) * 2016-05-16 2016-09-07 乐视控股(北京)有限公司 Advertisement playing method and device, and advertisement putting method, device and system
US10932011B2 (en) 2016-08-01 2021-02-23 Iheartmedia Management Services, Inc. Media substitution triggered by broadcast watermark
TWI802797B (en) 2019-05-10 2023-05-21 美商六科股份有限公司 Method for detecting and responding to a fingerprint mismatch detected after a previously detected fingerprint match, non-transitory computer-readable storage medium, and computing system
WO2020231927A1 (en) 2019-05-10 2020-11-19 The Nielsen Company (Us), Llc Content-modification system with responsive transmission of reference fingerprint data feature
WO2020231813A1 (en) 2019-05-10 2020-11-19 The Nielsen Company (Us), Llc Content-modification system with responsive transmission of reference fingerprint data feature
US11234050B2 (en) * 2019-06-18 2022-01-25 Roku, Inc. Use of steganographically-encoded data as basis to control dynamic content modification as to at least one modifiable-content segment identified based on fingerprint analysis
CN115309920A (en) * 2022-10-08 2022-11-08 国家广播电视总局信息中心 Audio and video management method and system based on fusion big data

Citations (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20060184960A1 (en) * 2005-02-14 2006-08-17 Universal Music Group, Inc. Method and system for enabling commerce from broadcast content
US20070157224A1 (en) * 2005-12-23 2007-07-05 Jean-Francois Pouliot Method and system for automated auditing of advertising
US20070220544A1 (en) * 2005-09-20 2007-09-20 Jon Nash-Putnam Method, system and program product for broadcast advertising and other broadcast content performance verification utilizing digital artifacts
US20080068500A1 (en) * 2006-09-18 2008-03-20 Rgb Networks, Inc. Methods, apparatus, and systems for insertion of overlay content into a video signal with transrating capabilities
US20090187941A1 (en) * 2008-01-21 2009-07-23 Att Knowledge Ventures L.P. System and method for targeted advertising
US7930716B2 (en) * 2002-12-31 2011-04-19 Actv Inc. Techniques for reinsertion of local market advertising in digital video from a bypass source
US7934230B2 (en) * 2007-05-04 2011-04-26 Alcatel Lucent IPTV architecture for dynamic commercial insertion
US20110145857A1 (en) * 2009-12-16 2011-06-16 Microsoft Corporation Scalable advertising system for dynamically inserting advertisements
US20120143660A1 (en) * 2008-06-26 2012-06-07 Thomson Licensing Interval based ad insertion used for the delivery of video streams
US8635640B2 (en) * 2008-12-24 2014-01-21 At&T Intellectual Property I, Lp System, method and computer program product for verifying triggers in a video data stream
US8924995B2 (en) * 2004-07-23 2014-12-30 The Nielsen Company (Us), Llc Methods and apparatus for monitoring the insertion of local media content into a program stream
US20150106845A1 (en) * 2006-11-23 2015-04-16 Mirriad Limited Process and apparatus for advertising component placement
US20150131971A1 (en) * 2009-12-12 2015-05-14 At&T Intellectual Property, Lp System, method and computer program product for updating advertising data for recorded video data

Family Cites Families (17)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4974085A (en) * 1989-05-02 1990-11-27 Bases Burke Institute, Inc. Television signal substitution
US6771316B1 (en) * 1996-11-01 2004-08-03 Jerry Iggulden Method and apparatus for selectively altering a televised video signal in real-time
US6493872B1 (en) * 1998-09-16 2002-12-10 Innovatv Method and apparatus for synchronous presentation of video and audio transmissions and their interactive enhancement streams for TV and internet environments
US6959143B2 (en) * 2000-12-26 2005-10-25 Eloda Inc. Advertising extracting system
WO2003062960A2 (en) * 2002-01-22 2003-07-31 Digimarc Corporation Digital watermarking and fingerprinting including symchronization, layering, version control, and compressed embedding
US20050015816A1 (en) 2002-10-29 2005-01-20 Actv, Inc System and method of providing triggered event commands via digital program insertion splicing
US8020000B2 (en) * 2003-07-11 2011-09-13 Gracenote, Inc. Method and device for generating and detecting a fingerprint functioning as a trigger marker in a multimedia signal
US8032754B2 (en) * 2004-01-09 2011-10-04 Microsoft Corporation Systems and methods for embedding media forensic identification markings
US20060075449A1 (en) 2004-09-24 2006-04-06 Cisco Technology, Inc. Distributed architecture for digital program insertion in video streams delivered over packet networks
US8600113B2 (en) * 2004-11-12 2013-12-03 The University Court Of The University Of St. Andrews System, method and computer program product for video fingerprinting
US20060253864A1 (en) 2005-03-15 2006-11-09 Optical Entertainment Network, Inc. System and method for household-targeted advertising
US11004090B2 (en) * 2005-12-24 2021-05-11 Rich Media Club, Llc System and method for creation, distribution and tracking of advertising via electronic networks
DE602006017001D1 (en) * 2006-06-29 2010-10-28 Thomson Licensing SYSTEM AND METHOD FOR OBJECT-ORIENTED FINGERPRINTS OF DIGITAL VIDEOS
WO2010001410A2 (en) * 2008-06-09 2010-01-07 Amagi Technologies Private Limited Content independent advertisement delivery platform for digital tv
US20120167133A1 (en) * 2010-12-23 2012-06-28 Carroll John W Dynamic content insertion using content signatures
US20130031577A1 (en) * 2011-07-25 2013-01-31 Pixley Christopher R System and method for selling any audio or video advertising and recording the ad spot, collecting payment and delivering by email to broadcast media using software and applications utilized through computing or telecommunications devices
US8639178B2 (en) * 2011-08-30 2014-01-28 Clear Channel Management Sevices, Inc. Broadcast source identification based on matching broadcast signal fingerprints

Patent Citations (14)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7930716B2 (en) * 2002-12-31 2011-04-19 Actv Inc. Techniques for reinsertion of local market advertising in digital video from a bypass source
US8924995B2 (en) * 2004-07-23 2014-12-30 The Nielsen Company (Us), Llc Methods and apparatus for monitoring the insertion of local media content into a program stream
US20060184960A1 (en) * 2005-02-14 2006-08-17 Universal Music Group, Inc. Method and system for enabling commerce from broadcast content
US20070220544A1 (en) * 2005-09-20 2007-09-20 Jon Nash-Putnam Method, system and program product for broadcast advertising and other broadcast content performance verification utilizing digital artifacts
US20070157224A1 (en) * 2005-12-23 2007-07-05 Jean-Francois Pouliot Method and system for automated auditing of advertising
US20080068500A1 (en) * 2006-09-18 2008-03-20 Rgb Networks, Inc. Methods, apparatus, and systems for insertion of overlay content into a video signal with transrating capabilities
US7688889B2 (en) * 2006-09-18 2010-03-30 Rgb Networks, Inc. Methods, apparatus, and systems for insertion of overlay content into a video signal with transrating capabilities
US20150106845A1 (en) * 2006-11-23 2015-04-16 Mirriad Limited Process and apparatus for advertising component placement
US7934230B2 (en) * 2007-05-04 2011-04-26 Alcatel Lucent IPTV architecture for dynamic commercial insertion
US20090187941A1 (en) * 2008-01-21 2009-07-23 Att Knowledge Ventures L.P. System and method for targeted advertising
US20120143660A1 (en) * 2008-06-26 2012-06-07 Thomson Licensing Interval based ad insertion used for the delivery of video streams
US8635640B2 (en) * 2008-12-24 2014-01-21 At&T Intellectual Property I, Lp System, method and computer program product for verifying triggers in a video data stream
US20150131971A1 (en) * 2009-12-12 2015-05-14 At&T Intellectual Property, Lp System, method and computer program product for updating advertising data for recorded video data
US20110145857A1 (en) * 2009-12-16 2011-06-16 Microsoft Corporation Scalable advertising system for dynamically inserting advertisements

Cited By (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20150281776A1 (en) * 2014-03-31 2015-10-01 Samarth Desai System and method for targeted advertising
US9451325B2 (en) * 2014-03-31 2016-09-20 Samarth Desai System and method for targeted advertising
US10713495B2 (en) 2018-03-13 2020-07-14 Adobe Inc. Video signatures based on image feature extraction
WO2020139608A1 (en) * 2018-12-26 2020-07-02 The Nielsen Company (Us), Llc Methods and apparatus to optimize reference signature matching using watermark matching
US10863236B2 (en) 2018-12-26 2020-12-08 The Nielsen Company (Us), Llc Methods and apparatus to optimize reference signature matching using watermark matching
US11438649B2 (en) * 2018-12-26 2022-09-06 The Nielsen Company (Us), Llc Methods and apparatus to optimize reference signature matching using watermark matching
US11259058B2 (en) * 2019-03-25 2022-02-22 Apple Inc. Use of rendered media to assess delays in media distribution systems
WO2021050376A1 (en) * 2019-09-13 2021-03-18 The Nielsen Company (Us), Llc Use of in-band metadata as basis to access reference fingerprints to facilitate content-related action
US11792254B2 (en) 2019-09-13 2023-10-17 Roku, Inc. Use of in-band metadata as basis to access reference fingerprints to facilitate content-related action
WO2023278251A1 (en) * 2021-06-30 2023-01-05 The Nielsen Company (Us), Llc Methods and apparatus to credit unidentified media
US20230129395A1 (en) * 2021-10-26 2023-04-27 Apple Inc. Synchronized playback of media content
US11831943B2 (en) * 2021-10-26 2023-11-28 Apple Inc. Synchronized playback of media content

Also Published As

Publication number Publication date
US20130177199A1 (en) 2013-07-11
WO2013008255A3 (en) 2013-04-11
US8737679B2 (en) 2014-05-27
WO2013008255A2 (en) 2013-01-17

Similar Documents

Publication Publication Date Title
US8737679B2 (en) System and method for seamless content insertion on network content using audio-video fingerprinting and watermarking
US10178443B2 (en) Enhanced metadata and content delivery using watermarks
US9197946B2 (en) System and method for automatic content program discovery
US20190356965A1 (en) Service signaling recovery for multimedia content using embedded watermarks
US8189859B2 (en) Apparatus for inserting watermark and method therefor
US10462536B2 (en) System for low-latency detection of known audio video content using audio fingerprinting and audio watermarking
US8978060B2 (en) Systems, methods, and media for presenting advertisements
US11653044B2 (en) Dynamic watermarking of media based on transport-stream metadata, to facilitate action by downstream entity
US11895361B2 (en) Automatic content recognition and verification in a broadcast chain
US9641877B2 (en) System and method for conveying session information for use in forensic watermarking
US20220150601A1 (en) Use of Watermarking to Trigger Fingerprint-Related Action
TWI744947B (en) Method, media client and computer-readable mediumfor use of watermarking to trigger fingerprint-related action

Legal Events

Date Code Title Description
STCB Information on status: application discontinuation

Free format text: ABANDONED -- FAILURE TO PAY ISSUE FEE